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PROTEIN-PROTEIN 
INTERACTIONS IN Helicobacter pylori 

FIELD OF THE INVENTION 

5 The present invention relates to proteins that interact with Helicobacter 

pylori. More specifically, the present invention relates to complexes of 
polypeptides or polynucleotides encoding the polypeptides, fragments of the 
polypeptides, antibodies to the complexes, Selected Interacting Domains (SID®) 
which are identified due to the protein-protein interactions, methods for screening 

10 drugs for agents which modulate the interaction of proteins and pharmaceutical 
compositions that are capable of modulating the protein-protein interactions. 

In another embodiment the present invention provides a protein-protein 
interaction map called a PIM® which is available in a report relating to the 
protein-protein interactions of Helicobacter pylori. 

15 BACKGROUND AND PRIOR ART 

Most biological processes involve specific protein-protein interactions. 
Protein-protein interactions enable two or more proteins to associate. A large 
number of non-covalent bonds form between the proteins when two protein 
surfaces are precisely matched. These bonds account for the specificity of 

20 recognition. Thus, protein-protein interactions are involved, for example, in the 
assembly of enzyme subunits, in antibody-antigen recognition, in the formation of 
biochemical complexes, in the correct folding of proteins, in the metabolism of 
proteins, in the transport of proteins, in the localization of proteins, in protein 
turnover, in first translation modifications, in the core structures of viruses and in 

25 signal transduction. 

General methodologies to identify interacting proteins or to study these 
interactions have been developed. Among these methods are the two-hybrid 
system originally developed by Fields and co-workers and described, for 



WO 02/066501 PCT/EP01/15428 

2 

example, in U.S. Patent Nos. 5,283,173, 5,468,614 and 5,667,973, which are 
hereby incorporated by reference. 

The earliest and simplest two-hybrid system, which acted as basis for 
development of other versions, is an in vivo assay between two specifically 
5 constructed proteins. The first protein, known in the art as the "bait protein" is a 
chimeric protein which binds to a site on DNA upstream of a reporter gene by 
means of a DNA-binding domain or BD. Commonly, the binding domain is the 
DNA-binding domain from either Gal4 or native E. coli LexA and the sites placed 
upstream of the reporter are Gal4 binding sites or LexA operators, respectively. 

10 The second protein is also a chimeric protein known as the "prey" in the 

art. This second chimeric protein carries an activation domain or AD. This 
activation domain is typically derived from Gal4, from VP 16 or from B42. 

Besides the two hybrid systems, other improved systems have been 
developed to detect protein-protein interactions. For example, a two-hybrid plus 

15 one system was developed that allows the use of two proteins as bait to screen 
available cDNA libraries to detect a third partner. This method permits the 
detection between proteins that are part of a larger protein complex such as the 
RNA polymerase II holoenzyme and the TFIIH or TFIID complexes. Therefore, 
this method, in general, permits the detection of ternary complex formation as 

20 well as inhibitors preventing the interaction between the two previously defined 
fused proteins. 

Another advantage of the two-hybrid plus one system is that it allows or 
prevents the formation of the transcriptional activator since the third partner can 
be expressed from a conditional promoter such as the methionine-repressed 
25 Met25 promoter which is positively regulated in medium lacking methionine. The 
presence of the methionine-regulated promoter provides an excellent control to 
evaluate the activation or inhibition properties of the third partner due to its "on" 
and "off switch for the formation of the transcriptional activator. The three-hybrid 
method is described, for example in Tirade et a!., The Journal of Biological 
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Chemistry, 272, No. 37 pp. 22995-22999 (1997) incorporated herein by 
reference. 

Besides the two and two-hybrid plus one systems, yet another variant is 
that described in Vidal et al, Proc. Natl. Sci. 93 pgs. 10315-10320 called the 
5 reverse two-and one-hybrid systems where a collection of molecules can be 
screened that inhibit a specific protein-protein or protein/DNA interaction, 
respectively. 

A summary of the available methodologies for detecting protein-protein 
interactions is described in Vidal and Legrain, Nucleic Acids Research Vol. 27, 
10 No. 4 pgs. 91 9-929 (1999) and Legrain and Selig, FEBS Letters 480 pgs. 32-36 
(2000) which references are incorporated herein by reference. 

However, the above conventionally used approaches and especially the 
commonly used two-hybrid methods have their drawbacks. For example, it is 
known in the art that, more often than not, false positives and false negatives 

15 exist in the screening method. In fact, a doctrine has been developed in this field 
for interpreting the results and in common practice an additional technique such 
as co-immunoprecipitation or gradient sedimentation of the putative interactors 
from the appropriate cell or tissue type are generally performed. The methods 
used for interpreting the results are described by Brent and Finley, Jr. in Ann. 

20 Rev. Genet, 31 pgs. 663-704 (1997). Thus, the data interpretation is very 
questionable using the conventional systems. 

One method to overcome the difficulties encountered with the methods in 
the prior art is described in WO 99/42612, incorporated herein by reference. This 
method is similar to the two-hybrid system described in the prior art in that it also 
25 uses bait and prey polypeptides. However, the difference with this method is that 
a step of mating at least one first haploid recombinant yeast cell containing the 
prey polypeptide to be assayed with a second haploid recombinant yeast cell 
containing the bait polynucleotide is performed. Of course the person skilled in 
the art would appreciate that either the first recombinant yeast cell or the second 
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recombinant yeast cell also contains at least one detectable reporter gene that is 
activated by a polypeptide including a transcriptional activation domain. 

The method described in W099/42612 permits the screening of more prey 
polynucleotides with a given bait polynucleotide in a single step than in the prior 
5 art systems due to the cell to cell mating strategy between haploid yeast cells. 
Furthermore, this method is more thorough and reproducible, as well as sensitive. 
Thus, the presence of false negatives and/or false positives is extremely minimal 
as compared to the conventional prior art methods. 

One of the prokaryotic microorganisms studied by the inventors is 
io Helicobacter pylori. Helicobacter pylori (H. pylori) is a microaerophilic, Gram 
negative, slow growing, spiral shaped and flagellated organism. H. pylori has 
been first isolated in 1983 from a gastric biopsy specimen of patient with chronic 
gastritis (Marshall et al„ 1984, Lancet, 1:1311-1314, Unidentified curved bacilli in 
the stomach of patients with gastritis and peptic ulceration). 

15 Helicobacter pylori has become identified as a primary cause of chronic 

gastroduodenal disorders, such as gastritis, dyspepsia, and peptic ulcers, in 
humans. Studies have shown (Labigne et al.) that H. pylori can be successfully 
eradicated by a treatment combining two antibiotics with a proton pump inhibitor. 
However, few antibiotics are active against H. pylori and antibiotic-resistant 

20 strains have begun to appear. 

The H. pylori strain n° 26695 genome has been studied by Tomb et al. 
(Tomb et al., 1997, Nature, vol. 388, 539-547, The complete genome sequence 
of the gastric pathogen Helicobacter pylori). This strain's genome consists of a 
circular chromosome with a size of 1,667,867 bp, average G + C content of 39%, 
25 and 1590 predicted coding sequences (open reading frames or "ORF"). 

The availability of the entire genome sequence of two clinical strains, 26695 
and J99 (Aim et al, 1999) (Tomb et al, 1997) has encouraged more global 
approaches to the functional analysis of the whole set of genes. In H. pylori, 
there is still as much as 42% of the H. pylori encoded proteins for which there is a 
30 need to assign biological function. Attempts to classify genes within functional 
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categories such as genes essential for viability, or conditionally essential in a 
given environment have been proposed by Akerley and collaborators (Akerley et 
al, 1998). This concept was first applied to H. pylori by Chalker et al. (Chalker et 
al, 2001) who analyzed for essentiality a set of genes selected by bioinformatic 
5 genome prioritization. 

The bacterial factors necessary for colonization of the gastric environment, 
and for virulence of this pathogen, are poorly understood. Examples of known 
virulence factors are: 

- Enzymes involved in neutralizing the acid gastric pH : the multisubunit 
10 urease is a characteristic enzyme that is crucial for survival in acidic pH 

and for successful colonization of the gastric environment, a site that few 
other microbes can colonize (Labigne et al., WO 93/07273, Helicobacter 
pylori genes necessary for the regulation and maturation of urease, and 
use thereof). Genes encoding ureases have been located on a 34 kb 
15 chromosome fragment and comprise ureA, ureB, ureC, ureD, ureE, ureF, 

ureG, ureH and urel. 

- Bacterial flagellar proteins responsible for motility across the mucous layer 
(Hazell et al., 1986, J. Inf. Dis., 153, 658-663 Campylobacter pyloridis and 
gastritis: association with intracellular spaces and adaptation to an 

20 environment of mucus as important factors in colonization of the gastric 

epithelium; Leying et al., 1992, Mol. Microbiol., 6, 2863-2874 Cloning and 
genetic characterization of Helicobacter pylori flagellin gene): flagellar 
filaments biosynthesis comprises A and B flagellins and the filament cap. 
These two biosyntheses are regulated by flbA gene (Suerbaum et al., 

25 French patent application, 1995, n° 2,736,360, Cloning and 

characterization of flbA gene of Helicobacter pylori, aflagellated strains 
production). 

- Two other essential toxins for virulence are VacA and CagA. 

VacA is a H. pylori toxin that induces the formation of large acidic vacuoles 
30 in host epithelial cells. These large vacuoles originate from massive swelling of 
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membranous compartments of late stages of the endocytic pathway (de Bernard 
et al., 1997, Microbiology, 26(4), 665-674, Helicobacter pylori toxin VacA induces 
vacuole formation by acting in the cell cytosol) Proof for receptor-mediated 
interaction with VacA has been made by Pagliaccia et al.; m2 allele of vacA gene 
has always been described as inactive in the in vitro HeLa cell assay, however, 
the m2 allele is associated with peptic ulcer and is prevalent in populations in 
which peptic ulcer and gastric cancer have high incidence (Pagliaccia et al., Proc. 
Natl. Acad. Sci. U.S.A, 1998, 95(17), 10212-10217, The m2 form of the 
Helicobacter pylori cytotoxin has cell type-specific vacuolating activity). 

CagA is one of the proteins encoded by the "cag pathogenicity island" 
(Spohn et al. 1997, Molecular Microbiology, 26(2), 361 -372,. Transcriptional 
analysis of the divergent cagAB genes encoded by the pathogenicity island of 
Helicobacter pylori) found in H. pylori strains isolated from most patients with 
peptic ulcer disease and adenocarcinoma. CagA is produced by 50-60% of H. 
pylori strains; it is a high molecular weight (120-140 kDa) superficial protein and 
an immunodominant antigen with unknown function. H. pylori strains that produce 
CagA protein have two genes cagB and cagC (36 and 101 kDa proteins, 
respectively). These genes are highly associated with duodenal ulcers (Blaser et 
al. 1996, WO 96/12825, cagB and cagC genes of Helicobacter pylori and related 
methods an d compositions). 

Other virulence factors are : several gastric tissue-specific adhesins (Boren 
et al., 1993, Science, 262, 1892-1895). 

Therapeutic agents are currently available that eradicate H. pylori infections 
in vitro. However, methods employing antibiotic agents result in the emergence of 
bacterial strains which are resistant to these agents. 

Thus, it is an object of the present invention to identify protein-protein 
interactions for Helicobacter pylori. 

It is another object of the present invention to identify protein-protein 
interactions of Helicobacter pylori for the development of more effective and 
better targeted therapeutic applications. 
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It is yet another object of the present invention to identify complexes of 
polypeptides or polynucleotides encoding the polypeptides and fragments of the 
polypeptides of Helicobacter pylori. 

It is yet another object of the present invention to identify antibodies to 
these complexes of polypeptides or polynucleotides encoding the polypeptides 
and fragments of the polypeptides of Helicobacter pylori including polyclonal, as 
well as monoclonal antibodies that are used for detection. 

It is still another object of the present invention to identify selected 
interacting domains of the polypeptides, called SID® polypeptides. 

It is still another object of the present invention to identify selected 
interacting domains of the polynucleotides, called SID® polynucleotides. 

It is another object of the present invention to generate protein-protein 
interactions maps called PIM®s. 

It is yet another object of the present invention to classify genes of H. 
pylori into functional categories at the genomic scale such as genes essential or 
nonessential for viability. 

It is yet another object of the present invention to identify Putative 
Essential Genes (PEGS) from H. pylori, as well as the true essential genes. 

It is yet another object of the present invention to establish a large scale 
protein-protein interaction map of H. pylori using the two-hybrid system as a way 
to elucidate the function of yet uncharacterized proteins. 

It is yet another object of the present invention to identify a superbinder 
phenotype in H. pylori with the two-hybrid system which completely inhibits 
specific protein-protein interactions. 

It is yet another object of the present invention to identify oligopeptides. 
Their overlapping or combining derivatives that inhibit H. pylori growth. 



WO 02/066501 PCT/EP01/15428 

8 

in yet another aspect, the present invention relates to the identification of 
ORFs (open reading frames) having enzymatic activity, which provides a direct 
way to screen lead compounds that abolish enzymatic activity through the 
disruption of the oligomeric interaction. 

5 It is yet another object of the present invention to provide a method for 

screening drugs for agents which modulate the interaction of proteins and 
pharmaceutical compositions that are capable of modulating the protein-protein 
interactions of Helicobacter pylori. 

it is another object to administer the nucleic acids of the present invention 
10 via gene therapy. 

It is yet another object of the present invention to provide protein chips or 
protein microarrays. 

It is yet another object of he present invention to provide a report in, for 
example paper, electronic and/or digital forms, concerning the protein-protein 
15 interactions, the modulating compounds and the like, as well as a PIM®. 

These and other objects are achieved by the present invention as 
evidenced by the summary of the invention, description of the preferred 
embodiments and the claims. 

SUMMARY OF THE PRESENT INVENTION 

20 Thus the present invention relates to a protein complex of polypeptides as 

described in Table 1. 

Furthermore, the present invention provides SID® polynucleotides and 
SID® polypeptides as defined in Figure 2, as well as a PIM® for Helicobacter 
pylori. 

25 The present invention also provides antibodies to the protein-protein 

complexes for Helicobacter pylori. 
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In another embodiment the present invention provides a method for 
screening drugs for agents that modulate the protein-protein interactions and 
pharmaceutical compositions that are capable of modulating protein-protein 
interactions. 

In another embodiment the present invention provides protein chips or 
protein microarrays. 

In another embodiment the present invention identifies a superbinder 
phenotype in H. pylori with the two-hybrid system which completely inhibits 
protein-protein interactions. 

In another aspect the present invention provides oligopeptides, their 
overlapping or combining derivatives thereof that inhibit H. pylori growth. 

In yet another embodiment, the present invention identifies ORFs having 
enzymatic activity which provides a direct way to screen lead compounds. 

In yet another embodiment the present invention provides a report in, for 
example, paper, electronic and/or digital forms. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is a schematic representation of the pB1 plasmid. 
Fig. 2 is a schematic representation of the pB5 plasmid. 
Fig. 3 is a schematic representation of the pB6 plasmid. 
Fig. 4 is a schematic representation of the pB13 plasmid. 
Fig. 5 is a schematic representation of the pB14 plasmid. 
Fig. 6 is a schematic representation of the pB20 plasmid. 
Fig. 7 is a schematic representation of the pP1 plasmid. 
Fig. 8 is a schematic representation of the pP2 plasmid. 
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Fig. 9 is a schematic representation of the pP3 plasmid. 

Fig. 10 is a schematic representation of the pP6 plasmid. 

Fig. 11 is a schematic representation of the pP7 plasmid. 

Fig. 12 is a schematic representation of vectors expressing the T25 
5 fragment. 

Fig. 13 is a schematic representation of vectors expressing the T18 
fragment. 

Fig. 14 is a schematic representation of various vectors of pCmAHL.1, 
pT25andpT18. 

10 Fig. 15 is a schematic representation identifying the SID®'s of 

Helicobacter pylori. In this figure the "Full-length prey protein" is the Open 
Reading Frame (ORF) or coding sequence (CDS) where the identified 
prey polypeptides are included. The Selected Interaction Domain (SID®) is 
determined by the commonly shared polypeptide domain of every selected 

15 prey fragment. 

Fig. 16 is a protein map (PIM®). 

Fig. 17 is a gel illustrating the results obtained for the disruption of the 
ORFs hp0099 to hp0198. This figure exemplifies first that multiple 
insertions of the transposon took place and second that for the majority 
20 trransposon insertion occurred at a distance ranging between 100 to 600 

bp from the 5'-end of the ORF, a distance compatible with the promotion 
of gene replacement by allelic recombination. 

Fig. 18 is a schematic diagram of the procedure for classification of the 
genes as described in the present invention. 

25 Fig. 19 are the results of three-hybrid experiments. Growth phenotypes of 

diploid strains containing various plasmids were analyzed by incubating 
cells at various dilutions (from 1 to 10~ 4 ). Yeast growth was performed 
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over 2 days at 30°C on DO-3+Met or DO-3-Met medium. Lane 1 are cells 
containing [p3H1-HP1230]+pP6-HP1529; Lane 2 [p3H1-HP1230- 
SID1529WT]+pP6-HP1529; lane 3[pH1-HP1230-SID1529(N38D- 
V53L)+pP6-HP1529; lane 4 [p3H1-HP1230-SID1529(V53L)]+pP6- 
5 HP1529; and lane 5 [p3H1-HP1230]+pP6-HP0875. 

Fig. 20 is the pP7-centro vector. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

As used herein the terms "polynucleotides", "nucleic acids" and 
10 "oligonucleotides" are used interchangeably and include, but are not limited to 
RNA, DNA, RNA/DNA sequences of more than- one nucleotide in either single 
chain or duplex form. The polynucleotide sequences of the present invention may 
be prepared from any known method including, but not limited to, any synthetic 
method, any recombinant method, any ex vivo generation method and the like, as 
15 well as combinations thereof. 

The term "polypeptide" means herein a polymer of amino acids having no 
specific length. Thus, peptides, oligopeptides and proteins are included in the 
definition of "polypeptide" and these terms are used interchangeably throughout 
the specification, as well as in the claims. The term "polypeptide" does not 
20 exclude posttranslational modifications such as polypeptides having covalent 
attachment of glycosyl groups, acetyl groups, phosphate groups, lipid groups and 
the like. Also encompassed by this definition of "polypeptide" are homologs 
thereof. 

By the term "homologs" is meant structurally similar genes contained 
25 within a given species, orthologs are functionally equivalent genes from a given 
species or strain, as determined for example, in a standard complementation 
assay. Thus, a polypeptide of interest can be used not only as a model for 
identifying similar genes in given strains, but also to identify homologs and 
orthologs of the polypeptide of interest in other species. The orthologs, for 
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example, can also be identified in a conventional complementation assay. In 
addition or alternatively, such orthologs can be expected to exist in bacteria (or 
other kind of cells) in the same branch of the phylogenic tree, as set forth, for 
example, at ftp://ft.cme.msu.edu/pub/rdp/SSU-rRNA/SSU/Prok.phvlo . 

As used herein the term "prey polynucleotide" means a chimeric 
polynucleotide encoding a polypeptide comprising (i) a specific domain; and (ii) a 
polypeptide that is to be tested for interaction with a bait polypeptide. The specific 
domain is preferably a transcriptional activating domain. 

As used herein, a "bait polynucleotide" is a chimeric polynucleotide 
encoding a chimeric polypeptide comprising (i) a complementary domain; and (ii) 
a polypeptide that is to be tested for interaction with at least one prey polypeptide. 
The complementary domain is preferably a DNA^binding domain that recognizes 
a binding site that is further detected and is contained in the host organism. 

As used herein "complementary domain" is meant a functional constitution 
of the activity when bait and prey are interacting; for example, enzymatic activity. 

As used herein "specific domain" is meant a functional interacting 
activation domain that may work through different mechanisms by interacting 
directly or indirectly through intermediary proteins with RNA polymerase II or Ill- 
associated proteins in the vicinity of the transcription start site. 

As used herein the term "complementary" means that, for example, each 
base of a first polynucleotide is paired with the complementary base of a second 
polynucleotide whose orientation is reversed. The complementary bases are A 
and T (or A and U) or C and G. 

The term "sequence identity" refers to the identity between two peptides or 
between two nucleic acids. Identity between sequences can be determined by 
comparing a position in each of the sequences which may be aligned for the 
purposes of comparison. When a position in the compared sequences is 
occupied by the same base or amino acid, then the sequences are identical at 
that position. A degree of sequence identity between nucleic acid sequences is a 
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function of the number of identical nucleotides at positions shared by these 
sequences. A degree of identity between amino acid sequences is a function of 
the number of identical amino acid sequences that are shared between these 
sequences. Since two polypeptides may each (i) comprise a sequence (i.e., a 
portion of a complete polynucleotide sequence) that is similar between two 
polynucleotides, and (ii) may further comprise a sequence that is divergent 
between two polynucleotides, sequence identity comparisons between two or 
more polynucleotides over a "comparison window" refers to the conceptual 
segment of at least 20 contiguous nucleotide positions wherein a polynucleotide 
sequence may be compared to a reference nucleotide sequence of at least 20 
contiguous nucleotides and wherein the portion of the polynucleotide sequence in 
the comparison window may comprise additions or deletions (i.e., gaps) of 20 
percent or less compared to the reference sequence (which does not comprise 
additions or deletions) for optimal alignment of the two sequences. 

To determine the percent identity of two amino acids sequences or two 
nucleic acid sequences, the sequences are aligned for optimal comparison. For 
example, gaps can be introduced in the sequence of a first amino acid sequence 
or a first nucleic acid sequence for optimal alignment with the second amino acid 
sequence or second nucleic acid sequence. The amino acrid residues or 
nucleotides at corresponding amino acid positions or nucleotide positions are 
then compared. When a position in the first sequence is occupied by the same 
amino acid residue or nucleotide as the corresponding position in the second 
sequence, the molecules are identical at that position. 

The percent identity between the two sequences is a function of the 
number of identical positions shared by the sequences. Hence % identity = 
number of identical positions / total number of overlapping positions X 100. 

In this comparison the sequences can be the same length or may be 
different in length. Optimal alignment of sequences for determining a comparison 
window may be conducted by the local homology algorithm of Smith and 
Waterman (J. Theor. Biol, 91 (2) pgs. 370-380 (1981), by the homology 
alignment algorithm of Needleman and Wunsch, J. Miol. Biol, 48(3) pgs. 443-453 
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(1972), by the search for similarity via the method of Pearson and Lipman, PNAS, 
USA, 85(5) pgs. 2444-2448 (1988), by computerized implementations of these 
algorithms (GAP, BESTFIT, FASTA and TFASTA in the Wisconsin Genetics 
Software Package Release 7.0, Genetic Computer Group, 575, Science Drive, 
Madison, Wisconsin) or by inspection. 

The best alignment (i.e., resulting in the highest percentage of identity over 
the comparison window) generated by the various methods is selected. 

The term "sequence identity" means that two polynucleotide sequences 
are identical (i.e., on a nucleotide by nucleotide basis) over the window of 
comparison. The term "percentage of sequence identity" is calculated by 
comparing two optimally aligned sequences over the window of comparison, 
determining the number of positions at which the identical nucleic acid base (e.g., 
A, T, C, G, U, or I) occurs in both sequences to yield the number of matched 
positions, dividing the number of matched positions by the total number of 
positions in the window of comparison (i.e., the window size) and multiplying the 
result by 100 to yield the percentage of sequence identity. The same process can 
be applied to polypeptide sequences. 

The percentage of sequence identity of a nucleic acid sequence or an 
amino acid sequence can also be calculated using BLAST software (Version 2.06 
of September 1998) with the default or user defined parameter. 

The term "sequence similarity" means that amino acids can be modified 
while retaining the same function. It is known that amino acids are classified 
according to the nature of their side groups and some amino acids such as the 
basic amino acids can be interchanged for one another while their basic function 
is maintained. 

The term "isolated" as used herein means that a biological material such 
as a nucleic acid or protein has been removed from its original environment in 
which it is naturally present. For example, a polynucleotide present in a plant, 
mammal or animal is present in its natural state and is not considered to be 
isolated. The same polynucleotide separated from the adjacent nucleic acid 



WO 02/066501 



15 



PCT/EP01/15428 



sequences in which it is naturally inserted in the genome of the plant or animal is 
considered as being "isolated." 

The term "isolated" is not meant to exclude artificial or synthetic mixtures 
with other compounds, or the presence of impurities which do not interfere with 
5 the biological activity and which may be present, for example, due to incomplete 
purification, addition of stabilizers or mixtures with pharmaceutical^ acceptable 
excipients and the like. 

"Isolated polypeptide" or "isolated protein" as used herein means a 
polypeptide or protein which is substantially free of those compounds that are 
10 normally associated with the polypeptide or protein in a naturally state such as 
other proteins or polypeptides, nucleic acids, carbohydrates, lipids and the like. 

The term "purified" as used herein means at least one order of magnitude 
of purification is achieved, preferably two or three orders of magnitude, most 
preferably four or five orders of magnitude of purification of the starting material 
is or of the natural material. Thus, the term "purified" as utilized herein does not 
mean that the material is 100% purified and thus excludes any other material. 

The term "variants" when referring to, for example, polynucleotides 
encoding a polypeptide variant of a given reference polypeptide are 
polynucleotides that differ from the reference polypeptide but generally maintain 

20 their functional characteristics of the reference polypeptide. A variant of a 
polynucleotide may be a naturally occurring allelic variant or it may be a variant 
that is known naturally not to occur. Such non-naturally occurring variants of the 
reference polynucleotide can be made by, for example, mutagenesis techniques, 
including those mutagenesis techniques that are applied to polynucleotides, cells 

25 or organisms. 

Generally, differences are limited so that the nucleotide sequences of the 
reference and variant are closely similar overall and, in many regions identical. 

Variants of polynucleotides according to the present invention include, but 
are not limited to, nucleotide sequences which are at least 95% identical after 
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alignment to the reference polynucleotide encoding the reference polypeptide. 
These variants can also have 96%, 97%, 98% and 99.999% sequence identity to 
the reference polynucleotide. 

Nucleotide changes present in a variant polynucleotide may be silent, 
which means that these changes do not alter the amino acid sequences encoded 
by the reference polynucleotide. 

Substitutions, additions and/or deletions can involve one or more nucleic 
acids. Alterations can produce conservative or non-conservative amino acid 
substitutions, deletions and/or additions. 

Variants of a prey or a SID® polypeptide encoded, by a variant 
polynucleotide can possess a higher affinity of binding and/or a higher specificity 
of binding to its protein or polypeptide counterpart, against which it has been 
initially selected. In another context, variants can also loose their ability to bind to 
their protein or polypeptide counterpart. 

By "anabolic pathway" is meant a reaction or series of reactions in a 
metabolic pathway that synthesize complex molecules from simpler ones, usually 
requiring the input of energy. An anabolic pathway is the opposite of a catabolic 
pathway. 

As used herein, a "catabolic pathway" is a series of reactions in a 
metabolic pathway that break down complex compounds into simpler ones, 
usually releasing energy in the process. A catabolic pathway is the opposite of an 
anabolic pathway. 

As used herein, "drug metabolism" is meant the study of how drugs are 
processed and broken down by the body. Drug metabolism can involve the study 
of enzymes that break down drugs, the study of how different drugs interact 
within the body and how diet and other ingested compounds affect the way the 
body processes drugs. 

As used herein, "metabolism" means the sum of all of the enzyme- 
catalyzed reactions in living cells that transform organic molecules. 
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By "secondary metabolism" is meant pathways producing specialized 
metabolic products that are not found in every cell. 

As used herein, "SID®" means a Selected Interacting Domain and is 
identified as follows: for each bait polypeptide screened, selected prey 
polypeptides are compared. Overlapping fragments in the same ORF or CDS 
define the selected interacting domain. 

As used herein the term "PIM®" means a protein-protein interaction map. 
This map is obtained from data acquired from a number of separate screens 
using different bait polypeptides and is designed to map out all of the interactions 
between the polypeptides. 

The term "affinity of binding", as used herein, can be defined as the affinity 
constant Ka when a given SID® polypeptide of the present invention which binds 
to a polypeptide and is the following mathematical relationship: 

[SID® /polypeptide complex] 

Ka = 

[free SID®] [free polypeptide] 

wherein [free SID®], [free polypeptide] and [SID® polypeptide complex] consist 
of the concentrations at equilibrium respectively of the free SID® polypeptide, of 
the free polypeptide onto which the SID® polypeptide binds and of the complex 
formed between SID® polypeptide and the polypeptide onto which said SID® 
polypeptide specifically binds. 

The affinity of a SID® polypeptide of the present invention or a variant 
thereof for its polypeptide counterpart can be assessed for example, on a 
Biacore™ apparatus marketed by Amersham Pharmacia Biotech Company such 
as described by Szabo et al Curr Opin Struct Biol 5 pgs. 699-705 (1995) and by 
Edwards and Leartherbarrow, Anal. Biochem 246 pgs. 1-6 (1997). 
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As used herein the phrase "at least the same affinity" with respect to the 
binding affinity between a SID® polypeptide of the present invention to another 
polypeptide means that the Ka is identical or can be at least two-fold, at least 
three fold or at least five fold greater than the Ka value of reference. 

As used herein, the term "modulating compound" means a compound that 
inhibits or stimulates or can act on another protein which can inhibit or stimulate 
the protein-protein interaction of a complex of two polypeptides or the protein- 
protein interaction of two polypeptides. 

More specifically, the present invention comprises complexes of 
polypeptides or polynucleotides encoding the polypeptides composed of a bait 
polypeptide, or a bait polynucleotide encoding a bait polypeptide and a prey 
polypeptide or a prey polynucleotide encoding a prey polypeptide. The prey 
polypeptide or prey polynucleotide encoding the prey polypeptide is capable of 
interacting with a bait polypeptide of interest in various hybrid systems. 

As described in the Background of the present invention there are various 
methods known in the art to identify prey polypeptides that interact with bait 
polypeptides of interest. These methods, include, but are not limited to, generic 
two- hybrid systems as described by Fields et al in Nature, 340:245-246 (1989) 
and more specifically in U.S. Patent Nos. 5,283,173, 5,468,614 and 5,667,973, 
which are hereby incorporated by reference; the reverse two-hybrid system 
described by Vidal et al, supra; the two plus one hybrid method described, for 
example, in Tirade et al, supra; the yeast forward and reverse 'n'-hybrid systems 
as described in Vidal and Legrain, supra; the method described in WO 99/42612; 
those methods described in Legrain et al FEBS Letters 480 pgs. 32-36 (2000) 
and the like. 

The present invention is not limited to the type of method utilized to detect 
protein-protein interactions and therefore any method known in the art and 
variants thereof can be used. It is however better to use the method described in 
WO 99/42612 or WO 00/66722, both references incorporated herein by 
reference due to the methods" sensitivity, reproducibility and reliability. 
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Protein-protein interactions can also be detected using complementation 
assays such as those described by Pelletier et al at 

http://ww.abrf.org/TBT/Artides/1BT0012/jbtO012.httnL WO 00/07038 and 

W098/34120. 

Although the above methods are described for applications in the yeast 
system, the present invention is not limited to detecting protein-protein 
interactions using yeast, but also includes similar methods that can be used in 
detecting protein protein interactions in, for example, mammalian systems as 
described, for example in Takacs et al., Proc. Nat. Acad. Sci., USA, 90 
(21):10375 (1993) and Vasavada et al., Proc. Nat. Acad. Sci., USA, 88 
(23): 10686-90 (1991), as well as a bacterial two-hybrid system as described in 
Karimov et al (1998), W099/28746, WO 00/66722 and Legrain et al FEBS 
Letters, 480 pgs. 32-36 (2001 ). 

The above-described methods are limited to the use of yeast, mammalian 
cells and Escherichia coli cells, however the present invention is not limited in this 
manner. Consequently, mammalian and typically human cells, as well as 
bacterial, yeast, fungus, insect, nematode and plant cells are encompassed by 
the present invention and may be transfected by the nucleic acid or recombinant 
vector as defined herein. 

Examples of suitable cells include, but are not limited to, VERO cells, 
HELA cells such as ATCC No. CCL2, CHO cell lines such as ATCC No. CCL61, 
COS cells such as COS-7 cells and ATCC No. CRL 1650 cells, W138, BHK, 
HepG2, 3T3 such as ATCC No. CRL6361, A549, PC12, K562 cells, 293 cells, 
Sf9 cells such as ATCC No. CRL171 1 and Cv1 cells such as ATCC No. CCL70. 

Other suitable cells that can be used in the present invention include, but 
are not limited to, prokaryotic host cells strains such as Escherichia coli, (e.g., 
strain DH5-a, Bacillus subtilis, Salmonella typhimurium, or strains of the genera 
of Pseudomonas, Streptomyces and Staphylococcus. 

Further suitable cells that can be used in the present invention include 
yeast cells such as those of Saccharomyces such as Saccharomyces cerevisiae. 
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The bait polynucleotide, as well as the prey polynucleotide can be 
prepared according to the methods known in the art such as those described 
above in the publications and patents reciting the known method perse. 

The bait polynucleotide of the present invention is obtained from genomic 
DNA of Helicobacter pylori. The prey polynucleotide is obtained from genomic 
DNA of Helicobacter pylori, variants of genomic DNA of Helicobacter pylori, and 
fragments from the genome or transcriptome of Helicobacter pylori ranging from 
about 20 to 5000. The prey polynucleotide is then selected, sequenced and 
identified. 

A genomic DNA prey library is prepared from the Helicobacter pylori and 
constructed in the specially designed prey vector pP6 as shown in Figure 10 after 
ligation of suitable linkers such that every genomic DNA insert is fused to a 
nucleotide sequence in the vector that encodes the transcription activation 
domain of a reporter gene. Any transcription activation domain can be used in the 
present invention. Examples include, but are not limited to, Gal4,YP16, B42, His 
and the like. 

Toxic reporter genes, such as CAT R , CYH2, CYH1, URA3, bacterial and 
fungi toxins and the like can be used in reverse two-hybrid systems. 

The polypeptides encoded by the nucleotide inserts of the genomic DNA 
prey library thus prepared are termed "prey polypeptides in the context of the 
presently described selection method of the prey polynucleotides. 

The bait polynucleotide can be inserted in bait plasmid as illustrated in 
Figure 1. The bait polynucleotide insert is fused to a polynucleotide encoding the 
binding domain of, for example, the Gal4 DNA binding domain and the shuttle 
expression vector is used to transform cells. 

As stated above, any cells can be utilized in transforming the bait and prey 
polynucleotides of the present invention including mammalian cells, bacterial 
cells, yeast cells, insect cells and the like. 
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In an embodiment, the present invention identifies protein-protein 
interactions in yeast. In using known methods a prey positive clone is identified 
containing a vector which comprises a nucleic acid insert encoding a prey 
polypeptide which binds to a bait polypeptide of interest. The method in which 
protein-protein interactions are identified comprises the following steps: 

i) mating at least one first haploid recombinant yeast cell clone 
from a recombinant yeast cell clone library that has been 
transformed with a plasmid containing the prey polynucleotide 
to be assayed with a second haploid recombinant yeast cell 
clone transformed with a plasmid containing a bait 
polynucleotide encoding for the bait polypeptide; 

ii) cultivating diploid cell' clones obtained in step i) on a selective 
medium; and 

iii) selecting recombinant cell clones which grow on the selective 
medium. 

This method may further comprise the step of: 

iv) characterizing the prey polynucleotide contained in each 
recombinant cell clone which is selected in step iii). 

In yet another embodiment of the present invention, in lieu of yeast, 
Escherichia coli is used in a bacterial two-hybrid system, which encompasses a 
similar principle to that described above for yeast, but does not involve mating for 
characterizing the prey polynucleotide. 

In yet another embodiment of the present invention, mammalian cells and 
a method similar to that described above for yeast for characterizing the prey 
polynucleotide are used. 

By performing the yeast, bacterial or mammalian two-hybrid system it is 
possible to identify for one particular bait an interacting prey polypeptide. The 
prey polypeptide that has been selected by testing the library of preys in a screen 
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using the two-hybrid, two plus one hybrid methods and the like, encodes the 
polypeptide interacting with the protein of interest. 

The present invention is also directed, in a general aspect, to a complex of 
polypeptides, polynucleotides encoding the polypeptides composed of a bait 
5 polypeptide or bait polynucleotide encoding the bait polypeptide and a prey 
polypeptide or prey polynucleotide encoding the prey polypeptide capable of 
interacting with the bait polypeptide of interest. These complexes are identified in 
Table 1, as the bait amino acid sequences and the prey amino acid sequences, 
as well as the bait and prey nucleic acid sequences. 

io In another aspect, the present invention relates to a complex of 

polynucleotides consisting of a first polynucleotide, or a fragment thereof, 
encoding a prey polypeptide that interacts with a bait polypeptide and a second 
polynucleotide or a fragment thereof. This fragment has at least 20 consecutive 
nucleotides, but can have between 20 and 5,000 consecutive nucleotides, or 

15 between 12 and 10,000 consecutive nucleotides or between 12 and 20,000 
consecutive nucleotides. 

The polypeptides of column 3 encoded by the polynucleotides of column 2 
in Tables 2 and 7 and the polypeptides of column 5 encoded by the 
polynucleotides of column 3 in Table 8 according to the present invention and 

20 the complexes of the two polypeptides encoded by the sets of two 
polynucleotides also form part of the present invention. In yet another 
embodiment, the present invention relates to an isolated complex of at least two 
polypeptides encoded by two polynucleotides wherein said two polypeptides are 
associated in the complex by affinity binding and are depicted in Table 1 and 

25 Table 8. 

In yet another embodiment, the present invention relates to an isolated 
complex comprising at least a polypeptide encoded by an ORF (HP####) of 
column 1 of Table 1 and a polypeptide encoded by an ORF (HP####) of column 
2 of Table 1 and Table 8. The present invention is not limited to these polypeptide 
30 complexes alone but also includes the isolated complex of the two polypeptides 
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in which fragments and/or homologous polypeptides exhibiting at least 95% 
sequence identity, as well as from 96% sequence identity to 99.999% sequence 
identity. 

Also encompassed in another embodiment of the present invention is an 
5 isolated complex in which the SID® polypeptide (see even SEQ ID Nos. from 2 to 
3256 in column 3 of Table 2, even SEQ ID Nos. 6590 to 6594 in Table 7 and 
even SEQ ID Nos. 6596 to 6644 in Table 8.) of the prey polypeptides encoded by 
uneven SEQ ID Nos. 1 to 3255 in column 2 of Table 2, uneven SEQ ID Nos. 
6589 to 6593 in Table 7 and uneven SEQ ID Nos. 6595 to 6643 in Table 8) 
10 forming the isolated complex. 

Besides the isolated complexes described above, nucleic acids coding for 
a Selected Interacting Domain (SID®) polypeptide or a variant thereof or any of 
the nucleic acids set forth in Tables 2, 7 and 8 can be inserted into an expression 
vector which contains the necessary elements for the transcription and translation 
is of the inserted protein-coding sequence. Such transcription elements include a 
regulatory region and a promoter. Thus, the nucleic acid which may encode a 
marker compound of the present invention is operably linked to a promoter in the 
expression vector. The expression vector may also include a replication origin. 

A wide variety of host/expression vector combinations are employed in 
20 expressing the nucleic acids of the present invention. Useful expression vectors 
that can be used include, for example, segments of chromosomal, non- 
chromosomal and synthetic DNA sequences. Suitable vectors include, but are not 
limited to, derivatives of SV40 and pcDNA and known bacterial plasmids such as 
col El, pCR1, pBR322, pMal-C2, pET, pGEX as described by Smith et al (1988), 
25 pMB9 and derivatives thereof, plasmids such as RP4, phage DNAs such as the 
numerous derivatives of phage I such as NM989, as well as other phage DNA 
such as M13 and filamentous single stranded phage DNA; yeast plasmids such 
as the 2 micron plasmid or derivatives of the 2 micron plasmid, as well as 
centomeric and integrative yeast shuttle vectors; vectors useful in eukaryotic cells 
30 such as vectors useful in insect or mammalian cells; vectors derived from 
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combinations of plasmids and phage DNAs, such as plasmids that have been 
modified to employ phage DNA or the expression control sequences and the tike. 

For example in a baculovirus expression system, both non-fusion transfer 
vectors, such as, but not limited to pVL941 (BamHI cloning site Summers, 

5 pVL1393 (BamHI, Sma\, Xba\, EcoRl, Notl, Xmalll, BglU and Pst\ cloning sites; 
Invitrogen) pVL1392 (BgM, Pstl, Not], Xmalll, EcoRl, Xball, Sma\ and BamHI 
cloning sites; Summers and Invitrogen) and pBlueBaclll (BamHI, Sg/ll, Pstl, Nco\ 
and HindWl cloning sites, with blue/white recombinant screening, Invitrogen), and 
fusion transfer vectors such as, but not limited to, pAc700 (BamHI and Kpnl 

10 cloning sites, in which the BamHI recognition site begins with the initiation codon; 
Summers), pAc701 and pAc70-2 (same as pAc700, with different reading 
frames), pAc360 (fiamHI cloning site 36 base pairs downstream of a polyhedrin 
initiation codon; Invitrogen (195)) and pBlueBacHisA, B, C (three different reading 
frames with BamHI, Sg/ll, Pstl, Ncol and HindWl cloning site, an N-terminal 

is peptide for ProBond purification and blue/white recombinant screening of 
plaques; Invitrogen (220) can be used. 

Mammalian expression vectors contemplated for use in the invention 
include vectors with inducible promoters, such as the dihydrofolate reductase 
promoters, any expression vector with a DHFR expression cassette or a 

20 DHFR/methotrexate co- amplification vector such as pED (Pstl, Sail, Sbal, Smal 
and EcoRl cloning sites, with the vector expressing both the cloned gene and 
DHFR; Kaufman, 1991). Alternatively a glutamine synthetase/methionine 
sulfoximine co-amplification vector, such as pEE14 (HindWl, Xball, Smal, Sbal, 
EcoRl and Bell cloning sites in which the vector expresses glutamine synthetase 

25 and the cloned gene; Celltech). A vector that directs episomal expression under 
the control of the Epstein Barr Virus (EBV) or nuclear antigen (EBNA) can be 
used such as pREP4 (BamHI, Sffl, Xhol, Notl, Nhel, HindlW, Nhel, PvuW and 
Kpnl cloning sites, constitutive RSV-LTR promoter, hygromycin selectable 
marker; Invitrogen) pCEP4 (BamHI, Sfil, Xhol, Nod, Nhel, HindWl, Nhel, Pvull 

30 and Kpnl cloning sites, constitutive hCMV immediate early gene promoter, 
hygromycin selectable marker; Invitrogen), pMEP4 (Kpnl, Pvul, Nhel, Hindlll, 
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Not\, Xho\, Sfft, BamHI cloning sites, inducible methallothionein Ha gene 
promoter, hygromycin selectable marker, Invitrogen), pREP8 (BamHI, Xhol, Not\, 
Hind\\\, Nhe\ and Kpnl cloning sites, RSV-LTR promoter, histidinol selectable 
marker; Invitrogen), pREP9 (Kpn\, Nhe\, Hind\\\, Not\, Xhol, Sfft, BamH\ cloning 
5 sites, RSV-LTR promoter, G418 selectable marker; Invitrogen), and pEBVHis 
(RSV-LTR promoter, hygromycin selectable marker, N-terminal peptide purifiable 
via ProBond resin and cleaved by enterokinase; Invitrogen). 

Selectable mammalian expression vectors for use in the invention include, 
but are not limited to, pRc/CMV (HindW, BstXl, Not\, Sba\ and Apa\ cloning sites, 

10 G418 selection, Invitrogen), pRc/RSV (HindW, Spei, BsfXI, Noil, Xbal cloning 
sites, G418 selection, Invitrogen) and the like. Vaccinia virus mammalian 
expression vectors (see, for example Kaufman 1991) that can be used In the 
present invention include, but are not limited to, pSC1 1 (Smal cloning site, TK- 
and p-gal selection), pMJ601 (Sa/I, Smal, Afl\, Nar\, BspMll, BamHI, Apa\, Nhe\, 

15 Sacll, Kpnl and Hindlll cloning sites; TK- and p-gal selection), pTKgptFIS 
(EcoRI, Pst\, Sal\\, Acd, HindW, Sba\, BamHI and Hpa cloning sites, TK or XPRT 
selection) and the like. 

Yeast expression systems that can also be used in the present invention 
include, but are not limited to, the non-fusion pYES2 vector (Xbal, Spfil, Sho\, 
20 Afofl, GsfXI, EcoRI, BsfXI, BamHI, Sad, Kpnl and Hind\\\ cloning sites, 
Invitrogen), the fusion pYESHisA, B, C (Xbal, Sph\, Shol, Wofl, BsfXI, EcoRI, 
BamHI, Sad, Kpnl and Hind\\\ cloning sites, N-terminal peptide purified with 
ProBond resin and cleaved with enterokinase; Invitrogen), pRS vectors and the 
like. 

25 Consequently, mammalian and typically human cells, as well as bacterial, 

yeast, fungi, insect, nematode and plant cells an used in the present invention 
and may be transfected by the nucleic acid or recombinant vector as defined 
herein. 



Examples of suitable cells include, but are not limited to, VERO cells, 
30 HELA cells such as ATCC No. CCL2, CHO cell lines such as ATCC No. CCL61 , 
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COS cells such as COS-7 cells and ATCC No. CRL 1650 cells, W138, BHK, 
HepG2, 3T3 such as ATCC No. CRL6361, A549, PC12, K562 cells, 293 cells, 
Sf9 cells such as ATCC No. CRL171 1 and Cv1 cells such as ATCC No. CCL70. 

Other suitable cells that can be used in the present invention include, but 
5 are not limited to, prokaryotic host cells strains such as Escherichia coli, (e.g., 
strain DH5-oc), Bacillus subtilis, Salmonella typhimurium, or strains of the genera 
of Pseudomonas, Streptomyces and Staphylococcus. 

Further suitable cells that can be used in the present invention include 
yeast cells such as those of Saccharomyces such as Saccharomyces cerevisiae. 

Besides the specific isolated complexes, as described above, the present 
invention relates to and also encompasses SID® polynucleotides. As explained 
above, for each bait polypeptide, several prey polypeptides may be identified by 
comparing and selecting the intersection of every isolated fragment that are 
included in the same polypeptide, as set forth, in Example 5. Thus the SID® 
polynucleotides of the present invention are represented by the nucleic acid 
sequences of uneven SEQ ID Nos. 1 to 3255 in column 2 of Table 2, uneven 
SEQ ID Nos. 6589 to 6593 in Table 7 and uneven SEQ ID Nos. 6595 to 6643 in 
Table 8 encoding the SID® polypeptides of even SEQ ID Nos. 2 to 3256. of Table 
2, the even SEQ ID Nos. 6590 to 6594 in Table 7 and the even SEQ ID Nos. 
6596 to 6644 in Table 8. 

The present invention is not limited to the SID® nucleic acid sequences as 
described in the above paragraph, but also includes fragments of these 
sequences having at least 6 consecutive nucleic acids, between 6 and 5,000 
consecutive nucleic acids and between 6 and 10,000 consecutive nucleic acids 
and between 6 and 20,000 consecutive nucleic acids, as well as variants thereof. 
The fragments or variants of the SID® sequences possess at least the same 
affinity of binding to its protein or polypeptide counterpart, against which it has 
been initially selected. Moreover this variant and/or fragments of the SID® 
sequences alternatively can have between 95% and 99.999% sequence identity 
to its protein or polypeptide counterpart. 
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According to the present invention the variants can be created by known 
mutagenesis techniques either in vitro or in vivo. Such a variant can be created 
such that it has altered binding characteristics with respect to the target protein 
and more specifically that the variant binds the target sequence with either higher 
5 or lower affinity. 

Polynucleotides that are complementary to the above sequences which 
include the polynucleotides of the SID®'s, their fragments, variants and those that 
have specific sequence identity are also included in the present invention. 

The polynucleotide encoding the SID® polypeptide, fragment or variant 
10 thereof can also be inserted into recombinant vectors which are described in 
detail above. 

The present invention also relates to a composition comprising the above- 
mentioned recombinant vectors containing the SID® polypeptides in Tables 2, 7 
and 8, fragments or variants thereof, as well as recombinant . host cells 
15 transformed by the vectors. The recombinant host cells that can be used in the 
present invention were discussed in greater detail above. 

The compositions comprising the recombinant vectors can contain 
physiological acceptable carriers such as diluents, adjuvants, excipients and any 
vehicle in which this composition can be delivered therapeutically and can 
20 include, but are not limited to sterile liquids such as water and oils. 

In yet another embodiment, the present invention relates to a method of 
selecting modulating compounds, as well as the modulating molecules or 
compounds themselves which may be used in a pharmaceutical composition. 
These modulating compounds may act as a cofactor, as an inhibitor, as 
25 antibodies, as tags, as a competitive inhibitor, as an activator or alternatively 
have agonistic or antagonistic activity on the protein-protein interactions. 

The activity of the modulating compound does not necessarily, for 
example, have to be 100% activation or inhibition. Indeed, even partial activation 
or inhibition can be achieved that is of pharmaceutical interest. 
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The modulating compound can be selected according to a method which 
comprises: 

(a) cultivating a recombinant host cell with a modulating compound on a 
selective medium and a reporter gene the expression of which is toxic 
for said recombinant host cell wherein said recombinant host cell is 
transformed with two vectors: 

(i) wherein said first vector comprises a polynucleotide encoding a first 
hybrid polypeptide having a DNA binding domain; 

(ii) wherein said second vector comprises a polynucleotide encoding a 
second hybrid polypeptide having a transcriptional activating 
domain that activates said toxic reporter gene when the first and 
second hybrid polypeptides interact; 

(b) selecting said modulating compound which inhibits or permits the 
growth of said recombinant host cell. 

Thus, the present invention relates to a modulating compound that inhibits 
the protein-protein interactions of a complex of two polypeptides of Table 1 and 
Table 8. The present invention also relates to a modulating compound that 
activates the protein-protein interactions of a complex of two polypeptides of 
Table 1 and Table 8. 

In yet another embodiment, the present invention relates to a method of 
selecting a modulating compound, which modulating compound inhibits the 
interactions of two polypeptides of Table 1. This method comprises: 

(a) cultivating a recombinant host cell with a modulating compound on a 
selective medium and a reporter gene the expression of which is toxic for 
said recombinant host cell wherein said recombinant host cell is 
transformed with two vectors: 
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(i) wherein said first vector comprises a polynucleotide encoding a 
first hybrid polypeptide having a first domain of an enzyme; 

(ii) wherein said second vector comprises a polynucleotide 
encoding a second hybrid polypeptide having an enzymatic 

5 transcriptional activating domain that activates said toxic reporter 

gene when the first and second hybrid polypeptides interact; 

(b) selecting said modulating compound which inhibits or permits the 
growth of said recombinant host cell. 

In the two methods described above any toxic reporter gene can be 
10 utilized including those reporter genes that can be used for negative selection 
including the URA3 gene, the CYH1 gene, the CYH2 gene and the like. 

In yet another embodiment, the present invention provides a kit for 
screening a modulating compound. This kit comprises a recombinant host cell 
which comprises a reporter gene the expression of which is toxic for the 

15 recombinant host cell. The host cell is transformed with two vectors. The first 
vector comprises a polynucleotide encoding a first hybrid polypeptide having a 
DNA binding domain; and a second vector comprises a polynucleotide encoding 
a second hybrid polypeptide having a transcriptional activating domain that 
activates said toxic reporter gene when the first and second hybrid polypeptides 

20 interact. 

In yet another embodiment a kit is provided for screening a modulating 
compound by providing a recombinant host cell, as described in the paragraph 
above, but instead of a DNA binding domain, the first vector comprises a first 
hybrid polypeptide containing a first domain of a protein. "The second vector 
25 comprises a second polypeptide containing a second part of a complementary 
domain of a protein that activates the toxic reporter gene when the first and 
second hybrid polypeptides interact. 

In the selection methods described above, the activating domain can be p42 
Gal 4, YP16 (HSV) and the DNA-binding domain can be derived from Gal4 or Lex 



WO 02/066501 



30 



PCT/EP01/15428 



A. The protein or enzyme can be adenylate cyclase, guanylate cyclase, DHFR 
and the like. 

SID® in Tables 2, 7 and 8 may be used as modulating compounds. 

In yet another embodiment, the present invention relates to a 
5 pharmaceutical composition comprising the modulating compounds for 
preventing or treating ulcers in a human or animal, most preferably in a mammal. 

This pharmaceutical composition comprises a pharmaceutical^ acceptable 
amount of the modulating compound. The pharmaceutically acceptable amount 
can be estimated from cell culture assays. For example, a dose can be 
10 formulated in animal models to achieve a circulating concentration range that 
includes or encompasses a concentration point or range having the desired effect 
in an in vitro system. This information can thus be used to accurately determine 
the doses in other mammals, including humans and animals. 

The therapeutically effective dose refers to that amount of the compound 
15 that results in amelioration of symptoms in a patient. Toxicity and therapeutic 
efficacy of such compounds can be determined by standard pharmaceutical 
procedures in cell cultures or in experimental animals. For example, the LD50 
(the dose lethal to 50% of the population) as well as the ED50 (the dose 
therapeutically effective in 50% of the population) can be determined using 
20 methods known in the art. The dose ratio between toxic and therapeutic effects is 
the therapeutic index which can be expressed as the ratio between LD50 and 
ED50 compounds that exhibit high therapeutic indexes. 

The data obtained from the cell culture and animal studies can be used in 
formulating a range of dosage of such compounds which lies preferably within a 
25 range of circulating concentrations that include the ED50 with little or no toxicity. 

The pharmaceutical composition can be administered via any route such as 
locally, orally, systemically, intravenously, intramuscularly, mucosally, using a 
patch and can be encapsulated in liposomes, microparticles, microcapsules, and 
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the like. The pharmaceutical composition can be embedded in liposomes or even 
encapsulated. 

Any pharmaceutical^ acceptable carrier or adjuvant can be used in the 
pharmaceutical composition. The modulating compound will be preferably in a 
5 soluble form combined with a pharmaceutically acceptable carrier. The 
techniques for formulating and administering these compounds can be found in 
"Remington's Pharmaceutical Sciences" Mack Publication Co., Easton, PA, 
latest edition. 

The mode of administration optimum dosages and galenic forms can be 
10 determined by the criteria known in the art taken into account the seriousness of 
the general condition of the mammal, the tolerance of the treatment and the side 
effects. 

The present invention also relates to a method of treating or preventing 
ulcers in a human or mammal in need of such treatment. This method comprises 
15 administering to a mammal in need of such treatment a pharmaceutically 
effective amount of a modulating compound which binds to a targeted bacterial 
protein. In a preferred embodiment, the modulating compound is a polynucleotide 
which may be placed under the control of a regulatory sequence which is 
functional in the mammal or human. 

20 In yet another embodiment, the present invention relates to a 

pharmaceutical composition comprising a SID® polypeptide, a fragment or 
variant thereof. The SID® polypeptide, fragment or variant thereof can be used in 
a pharmaceutical composition provided that it is endowed with highly specific 
binding properties to a bait polypeptide of interest. 

25 The original properties of the SID® polypeptide or variants thereof interfere 

with the naturally occurring interaction between a first protein and a second 
protein within the cells of the organism. Thus, the SID® polypeptide binds 
specifically to either the first polypeptide or the second polypeptide. 
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Therefore, the SID® polypeptides of the present invention or variants 
thereof interfere with protein-protein interactions of Helicotacter pylori proteins or 
between Helicobacter pylori proteins and mammal, for example, human proteins. 

Thus, the present invention relates to a pharmaceutical composition 
5 comprising a pharmaceutical!;/ acceptable amount of a SID® polypeptide or 
variant thereof, provided that the variant has the above-mentioned two 
characteristics; i.e., that it is endowed with highly specific binding properties to a 
bait polypeptide of interest and is devoid of biological activity of the naturally 
occurring protein. 

io In yet another embodiment, the present invention relates to a 

pharmaceutical composition comprising a pharmaceutically effective amount of a 
polynucleotide encoding a SID® polypeptide 'or a variant thereof wherein the 
polynucleotide is placed under the control of an appropriate regulatory sequence. 
Appropriate regulatory sequences that are used are polynucleotide sequences 

15 derived from promoter elements and the like. 

Polynucleotides that can be used in the pharmaceutical composition of the 
present invention include the nucleotide sequence: of uneven SEQ ID Nos. 1 to 
3255 in column 2 of Table 2, uneven SEQ ID Nos. 6589 to 6593 in Table 7 and 
uneven SEQ ID Nos. 6595 to 6643 in Table 8. 

20 Besides the SID® polypeptides and polynucleotides, the pharmaceutical 

composition of the present invention can also include a recombinant expression 
vector comprising the polynucleotide encoding the SID® polypeptide, fragment or 
variant thereof. 

The above described pharmaceutical compositions can be administered by 
25 any route such as orally, systemically, intravenously, intramuscularly, 
intradermal^, mucosally, encapsulated, using a patch and the like. Any 
pharmaceutically acceptable carrier or adjuvant can be used in this 
pharmaceutical composition. 
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The SID® polypeptides as active ingredients will be preferably in a soluble 
form combined with a pharmaceutically acceptable carrier. The techniques for 
formulating and administering these compounds can be found in "Remington's 
Pharmaceutical Sciences" supra. 

5 The amount of pharmaceutically acceptable SID® polypeptides can be 

determined as described above for the modulating compounds using cell culture 
and animal models. 

Such compounds can be used in a pharmaceutical composition to treat or 
prevent ulcer. 

io Thus, the present invention also relates to a method -of preventing or 

treating ulcer in a mammal said method comprising the steps of administering to 
a mammal in need of such treatment a pharmaceutically effective amount of: 

(1 ) a SID® polypeptide of even SEQ ID Nos. 2 to 3256 in column 3 of Table 
2, even SEQ ID Nos. 6590 to 6594 in Table 7 and even SEQ ID Nos. 6596 to 

15 6644 in Table 8 or variants thereof which binds to a targeted mammalian or 
typically human protein; or 

(2) or SID® polynucleotide encoding a SID® polypeptide of uneven SEQ ID 
Nos. 1 to 3255 in column 2 of Table 2, uneven SEQ ID Nos. 6589 to 6593 in 
Table 7 and uneven SEQ ID Nos. 6595 to 6643 in Table 8 or variants or 

20 fragments thereof wherein said polynucleotide is placed under the control of a 
regulatory sequence which is functional in said mammal; or 

(3) a recombinant expression vector comprising a polynucleotide encoding a 
SID® polypeptide which binds to a bacterial protein. 

In another embodiment the present invention nucleic acids comprising a 
25 sequence which encodes the SID® proteins of Table 2 Table 7 and Table 8 
and/or functional derivatives thereof are administered to modulate complex of 
Table 1 and Table 8 by way of gene therapy. Any of the methodologies relating to 
gene therapy available within the art may be 
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used in the practice of the present invention such as those described by 
Goldspiel et al Clin. Pharm. 12 pgs. 488-505 (1993). 

Delivery of the therapeutic nucleic acid into a patient may be direct in vivo 
gene therapy (i.e., the patient is directly exposed to the nucleic acid or nucleic 
5 acid containing vector) or indirect ex vivo gene therapy (i.e., cells are first 
transformed with the nucleic acid in vitro and then transplanted into the patient). 

For example for in vivo gene therapy, an expression vector containing the 
nucleic acid is administered in such a manner that it becomes intracellular; i.e., by 
infection using a defective or attenuated retroviral or other viral vectors as 
10 described, for example in U.S. Patent 4,980,286 or by Robbins et al, Pharmacol. 
Ther. , 80 No. 1 pgs. 35-47 (1998). 

The various retroviral vectors that are known in the art are such as those 
described in Miller et al, Meth. Enzymol. 217 pgs. 581-599 (1993) which have 
been modified to delete those retroviral sequences which are not required for 

15 packaging of the viral genome and subsequent integration into host cell DMA. 
Also adenoviral vectors can be used which are advantageous due to their ability 
to infect non-dividing cells and such high-capacity adenoviral vectors are 
described in Kochanek, Human Gene Therapy, 10, pgs. 2451-2459 (1999). 
Chimeric viral vectors that can be used are those described by Reynolds et al, 

20 Molecular Medecine Today, pgs. 25 -31 (1999). Hybrid vectors can also be used 
and are described by Jacoby et al, Gene Therapy, 4, pgs. 1282-1283 (1997). 

Direct injection of naked DNA or through the use of microparticle 
bombardment (e.g., Gene Gun®; Biolistic, Dupont) or by coating it with lipids can 
also be used in gene therapy. Cell-surface receptor s/transfecting agents or 

25 through encapsulation in liposomes, microparticles or microcapsules or by 
administering the nucleic acid in linkage to a peptide which is known to enter the 
nucleus or by administering it in linkage to a ligand predisposed to receptor- 
mediated endocytosis (See, Wu & Wu, J. Biol. Chem., 262 pgs. 4429-4432; ( 
1987)) can be used to target cell types which specifically express the receptors of 

30 interest. 
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In another embodiment a nucleic acid ligand compound may be produced 
in which the ligand comprises a fusogenic viral peptide designed so as to disrupt 
endosomes, thus allowing the nucleic acid to avoid subsequent lysosomal 
degradation. The nucleic acid may be targeted in Vivo for cell specific endocytOsis 
5 and expression by targeting a specific receptor such as that described in 
W092/06180, W093/14188 and WO 93/20221. Alternatively the nucleic acid may 
be introduced intracellular^ and incorporated within the host cell genome for 
expression by homologous recombination. See, Zijlstra et al, Nature, 342, pgs. 
435-428 (1989). 

io In ex vivo gene a gene is transferred into cells in vitro using tissue culture 

and the cells are delivered to the patient by various methods' such as injecting 
subcutaneously, application of the cells into a skin graft and the intravenous 
injection of recombinant blood cells such as hematopoietic stem or progenitor 
cells. 

15 Cells into which a nucleic acid can be introduced for the purposes of gene 

therapy include, for example, epithelial cells, endothelial cells, keratinocytes, 
fibroblasts, muscle cells, hepatocytes and blood cells. The blood cells that can be 
used include, for example, T-lymphocytes, B-lymphocytes, monocytes, 
macrophages, neutrophils, eosinophils, megakaryocytes, granulocytes, 

20 hematopoietic cells or progenitor cells and the like. 

In' yet another embodiment the present invention relates to protein chips or 
protein microarrays. It is well known in the art that microarrays can contain more 
than 10,000 spots of a protein that can be robotically deposited on a surface of a 
glass slide or nylon filter. The proteins attach covalently to the slide surface, yet 

25 retain their ability to interact with other proteins or small molecules in solution. In 
some instances the protein samples can be made to adhere to glass slides by 
coating the slides with an aldehyde-containing reagent that attaches to primary 
amines. A process for creating microarrays is described, for example by 
MacBeath and Schreiber in Science, Volume 289, Number 5485, pgs, 1760-1763 

:,o (2000) or Service, Science, Vol, 289, Number 5485 pg. 1673 (2000). An 
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apparatus for controlling, dispensing and measuring small quantities of fluid is 
described, for example, in U.S. Patent No. 6,1 12,605. 

The present invention also provides a record of protein-protein interactions, 
PIM®'s, SID®'s and any data encompassed in the following Tables. It will be 
5 appreciated that this record can be provided in paper or electronic or digital form. 

In yet another embodiment, the present invention relates to the classification 
of H. pylori within functional categories such as genes essential or non essential 
for viability using the general method described in Figure 18. In this regard, two 
exhaustive libraries of H. pylori ORFs were constructed in E.coli The first library 

10 contained every H. pylori (strain 26695) cloned individually (Library I) while the 
second one (Library II), contained these ORFs disrupted by a transposable 
element. These two ordered libraries are valuable tools for a large project of 
systematic inactivation of every ORF of the H. pylori genome. They were used to 
develop a strategy to search at the genomic scale for genes essential for the 

15 viability of the bacterium grown in vitro. The inactivation strategy was applied to a 
series of 138 ORFs that were selected on two different criteria. Ninety six of them 
were previously shown to encode either proteins involved in protein-protein 
interaction in the two-hybrid yeast assay (Rain et al, 2001), and 42 encode H. 
pylori specific protein with no known function. The screening procedure led to the 

20 identification of 40 Putative Essential Genes (PEGs), of which 15 were shown to 
be true essential genes. The combination of both essentiality as well as the 
identification of interacting domains might serve as a direct pathway for the 
design of active compounds capable of inhibiting protein-protein interactions and 
possibly bacterial growth was the reasoning used behind this analysis. 

25 Construction and validation of the two ordered libraries 

Library I consisted of an individual and ordered {hp0001 to hp1590) bank of 
every putative ORF according to the information provided on the Webb TIGR site 
in 1997. Based on the 1590 ORFs identified at that time, 5' and 3'- 
oligonucleotides were synthesized with the characteristics described in Example 
30 10 and in Table 3. Each ORF was cloned into a plLL570-» derivative plasmid, 3.6 
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kb in length. This derivative plasmid corresponds to plLL570 (Labigne et al, 
1992) in which DNA from the HindlU site of the polylinker to the Aval site (position 
1425 of the pBR322 bone) has been excluded by reverse PCR using pll_L570 as 
a template and 570-1 plus 570-2 as primers (Table 4). Each ORF was cloned in 
such way that the 5'-end of the gene (including the ATG) was inserted 
immediately downstream the three transcriptional and translational stops of 
plLL570 (Labigne et al, 1992) to prevent toxicity of the recombinant proteins in E. 
coli. The library consists of 17 times 96-well plates (plate 1.1 to plate XVII. 1). The 
recombinant plasmids were transformed into DH5a E. coli cells harboring the 
pTCA plasmid, a plasmid that confers resistance to tetracycline, encodes the Tn3 
transposase and is immune to Tn3 (Seifert et al, 1986). The presence of the two 
compatible plasmids plLL570-HP000X plus pTCA was checked by plasmid 
extraction and gel electrophoresis on individual isolated tetracycline, 
spectinomycin, kanamycin resistant clones. In addition, using primers 570-3 and 
570-4 (corresponding to the bounderies of the cloning site on plLL570-») the 
adequation between the size, of the cloned PCR product and that of the 
corresponding ORF was confirmed. Library I consists of all the putative ORFs 
described on the TIGR Webb site in 1997 with the exclusion of 40 ORFs (hp01, 
10, 46, 56, 94, 160, 223, 264, 289, 293, 399, 415, 435, 440, 453, 159, 464, 465, 
488, 547, 607, 722, 790, 814, 846, 876, 884, 898, 968, 1007, 1069, 1205, 1248, 
1304, 1358, 1394, 1452, 1460, 1497, 1511) for which either the initial gene 
amplification or the final cloning failed. 

Library II consists of the random insertion of Tn3-Km into each of the 
recombinant plLL570— /?p000X plasmid. Process of disruption has been 
designed to generate multiple independent transposon insertions for each cloned 
ORF. Tn3-Km was shown to preferentially map into the H. pylori inserts due to 
both the intrinsic properties of Tn3 that transposes into AT rich DNA region and 
the requirement of maintaining intact replicative function and spectinomycin 
modifying enzyme (aadA). The efficiency of the whole procedure was checked 
for five plates. For those 5 plates, the resulting kanamycin transconjugants of 96 
independent cloned ORFs were kept individually and as pools of plasmids. The 
96 disrupted recombinant plasmids were extracted together and used as a 
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template for individual PCR assays using as primers the 38bp-R (Table 4) of the 
inverted repeats of Tn3 transposon and each of the 5'-end specific 
oligonucleotide used for the cloning of the ORFs. Figure 17 illustrates the results 
obtained for the disruption of the ORFs hp0099 to hp0198 (Plate II.2). This figure 
5 exemplifies first that multiple insertions of the transposon took place, second that 
for the majority trransposon insertion occurred at a distance ranging between 100 
to 600 bp from the 5'-end of the ORF, a distance compatible with the promotion of 
gene replacement by allelic recombination. 

Screenin g of Putative Essential Genes of H. pylori within a series of 138 
10 selected ORFs 

The ordered library of disrupted H. pylori ORFs in E. coli was used for the 
genomic screening of putative essential genes (PEGs), and the screening of a 
subset of 96 individual selected ORFs of H. pylori strain 26695 (Table 5) as 
genes encoding proteins demonstrating homodimeric or heterodimeric protein- 

15 protein interactions [Rain, 2001 #14], and the screening of a subset of 42 ORFs 
encoding H. pylori specific predicted protein with no known function (Table 6). 
Included in the set of 138 ORFs were ORFs used as controls: ORFs known to be 
essential [ groES (hp0011); holB (hp1231), dnaA (hp1529], or ORFs known to be 
non-essential for viability in vitro: urel, {hp0071); rdxA {hp0954); ggt (hp1118) 

20 representive of various sizes. For the 138 ORFs, individual Tn3-Km disrupted 
recombinant plasmids were extracted from Library II, and used to transform H. 
pylori strain HAS141 (Janvier et al, 1999). This isolate was chosen for the initial 
screening due to its high natural transformation competency, two logs above that 
of the sequenced strain 26695, and its ability to colonize mouse stomach (Janvier 

25 et al, 1999). Kanamycin transformants were obtained for all but the hspA gene 
as expected, and 40 of the 138 tested ORFs, namely hp0061, 175, 377, 419, 
553, 650, 739, 862, 928, 990, 1012, 1014, 1074, 1230, 1245, 1263, 1493 for the 
first series (Table 5), and hp0130, 231, 271, 358, 394, 659, 697, 699, 721, 726, 
746, 838, 935, 947, 953, 973, 1023, 1028, 1039, 1053, 1085, 1265, 1568 (Table 

30 6) which thus can be designated as Putative Essential Genes. Due to the 
presence of the two large terminal inverted repeats (38 bp in size) within Tn3-Km 
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which quickly reanneaied during the amplification procedure, gene amplification 
of the disrupted allele using the respective 5' and 3' oligonucleotides of a given 
ORF often led to the production of a PCR product with a size similar to that of the 
parental allele, and thus was not helpful to confirm gene replacement of the 

5 parental by the disrupted allele. Thus, six individual kanamycin resistant 
transformants obtained for the 78 knock-out (KO) genes were controlled by gene 
amplification using the 5' and the 3' oligonucleotides of the KO-ORF respectively 
in pairs with the 38 bp (Table 4) of the inverted repeat of the transposon. Criteria 
for allelic replacement were that the sum of the size of the two PCR products be 

io identical to that of the KO-ORF. The final identification of the disrupted ORF and 
of the site of insertion was done by sequencing one of the two PCR products. 
For ORFs with a size over 700 bp two or three different transposon insertions 
mapping in the middle of the ORF were commonly observed among the six 
analysed transformants. For a few ORFs, the 5' and the 3' oligonucleotides 

15 initially designed from the sequence of strain 26695, were unable to amplify the 
chromosome of the HAS141 parental strain due to intrinsic polymorphism. In 
these cases, the 5' and 3' oligos from ORFn+1 or ORFn-1 together with the 38 bp 
were used. Unexpectedly, a few kanamycin resistant transformants were 
obtained for hp1231 (holB), and hp1529 {dnaA). For these genes the insertion 

20 transposon mapped at the very 3' end of the genes, in contrast to the other genes 
where the transposable element mapped at any place but at a minimal distance 
of 300 bp of either end of the genes. These observations indicate that for some 
of the genes, known to be essential, the kanamycin resistant gene could be 
rescued, without allelic replacement, by integration of the whole plasmid via a 

25 single crossing-over. To estimate the frequency of this event, 260 preparations of 
chromosomal DNA from individual 210) or pooled (50) kanamycin resistant H. 
pylori transformants were spotted on nitrocellulose, denatured and hybridized 
with a probe consisting of the plLL570- vector. Of 260 DNA preparations tested, 
chromosomal DNA extracted from individual transformants of the disrupted 

30 hp1231(holB), 1514 (unknown) and 1529 {dnaA) as well as some (and not all) of 
the individual clones of hp0224 and hp0822 did hybridize with part of the vector 
confirming that a single crossing-over took place. These results underlined that 
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the 77i3-Km strategy is a powerful approach to be used as a first screen, at a 
genomic scale, for the identification of PEGs. The definitive assignment of a non- 
essential status cannot be done exclusively on the presence of kanamycin 
transformants, but had to be confirmed and tested for absence of vector DNA 
5 within the chromosome of transformants by hybridization. 

Essentiality and functional analysis of 15 of the 40 ORFs identified as 
Putative Essential Genes 

According to an in-house definition, PEGs consist of ORFs that did not lead 
to the isolation of kanamycin transformants following the transformation of the 

io parental isolate HAS141 with a pool of disrupted ORF. However, absence of 
kanamycin transformants cannot be directly associated with the identification of a 
true essential gene. Several explanations might account for the lack of knock out 
kanamycin resistant mutant, such as (i) absence of the specific gene in the tested 
strain, (ii) polar effect of the transposon on an essential ORF located downstream 

is of the PEG, a property associated with miniTn3-Km (Skouloubris et al, 1998) (iii) 
experimental failure due to the small size of the ORF and of the bordering 
sequences required for allelic exchange, thus, to confirm the essential character 
of a PEG (i.e., essentiality) additional experiments were achieved. For each of 
the 40 PEGs identified through the initial screening in strain HAS141 (Tables 5 & 

20 6), the following criteria were used to confirm their essentiality. First, the 
distribution of the different ORFs of the genome by DNA/DNA hybridizations 
within clinical isolates to confirm their presence not only in the transformed isolate 
(HAS141), but also in every clinical isolate tested was studied. Second, a non- 
polar mutation was introduced into the PEG cloned in E. coli following the 

25 approach depicted in the Materials and Methods section, performed by reverse 
PCR from plasmids of Library I. The recombinant plasmids consist, in each case, 
of the 0.9 kb-Kanamycin promoter-less cassette flanked respectively upstream 
and downstream with the 300 first and the 300 last nucleotides of the ORF to 
knock-out. In addition, the cassette carries a ribosome binding site and a start 

?o codon (ATG) in phase with the hundred 3'-terminal codons of the KO gene 
allowing the translation of the end of the gene to overcome any 
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transcriptional/translational coupling effect. The resulting constructed plasmid 
was transformed in four different H. pylori genetic backgrounds: strains HAS141, 
N6, X47-2An and the sequenced strain 26695. Taking into account these criteria 
(Figure 17), the following conclusions were drawn relative to the 40 initially 
5 identified PEGs. 

Strain HAS141 lacks ORFs h P 0990 and hp1074. These absences accounted 
for the lack of mutant for these two ORFs, whereas the other 38 PEGs were 
found to be present in each of the 37 tested clinical isolates (Salama et al, 2000) 
and manuscript in preparation). The fact that no HAS141 clones resistant to 

io kanamycin were obtained when transformed with the derivative plasmids 
containing the disrupted hp0990 and hp1074 ORFs confirmed that in the absence 
of portable homologous region integration of the plasmid does not occur. These 
two ORFs were not studied further since a non ubiquitous ORF has low 
probability to be essential and even if so, would not be a good candidate for H. 

5 pylori specific therapeutical drug design. 

Non polar kanamycin mutants were unambiguously obtained in HAS141, N6, 
X47-2An as well as 26695 for ORFs hp0061, 419, 553, 650, 1263, and 1493 of 
the first series of genes encoding interacting protein (Table 5), and hp0130, 271, 
358, 697, 699, 721, 726, 746, 838, 935, 947, 953, 973, 1023, 1028, 1039, 1053 
(Table 6). Gene replacement of the parental allele by the deleted and disrupted 
allele was confirmed by testing the chromosome of the mutant for the 
disappearance of a PCR product with a size identical to that of the parental ORF 
and its replacement by the expected size (1.5 kb PCR product: 300 + 900+ 300 
bp). When by chance, the parental ORF had a size ranging around 1.5 kb, gene 
replacement was confirmed by restricting the 1.5 kb PCR product with Sma\ to 
release the 0.9 kb-Km non-polar cassette. 

For 15 of the 38 ubiquitous PEGs, we were unable to obtain kanamycin 
transformants when introducing a non polar mutation in N6, X47-2An, 26695, 
although the genes were found to be present in every isolates (ubiquitous). One 
or two very rare clones were obtained for some of the genes exclusively in 
HAS141. All these kanamycin resistant HAS141 transformants were positive 
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when tested by hybridization with the labeled plLL570— vector used as a probe, 
again attesting for the presence of rare but possible single crossing-over seen 
under strong selective pressure. These 15 genes {hp0175, hp0231, hp0377, 
h P 0394, hp0659, hp0739, hp0862, hp0928, hp1012, hp1014, hp1085, hp1230, 
hp1245, hp1265, hp1568) can thus be definitively recognized as genes essential 
for the viability of H. pylori in vitro. Among these 15 genes, 9 are known to be 
genes that encode proteins involved in protein-protein interactions, and 6 were 
selected as encoding H. pylori specific protein without known function. They 
encode proteins with properties that will be discussed and classified in regards to 
their potential as putative therapeutical targets. 

Search for interacting proteins when using the 6 ubiquitous H. pylori 
specific essential genes as a bait in the two hybrid assay 

The h P 0231, hp0394, hp0659, hp1085, hp1265, and hp1568 ORFs were 
cloned in the pB6vector and used as individual bait for the identification of 
interacting proteins (Table 7). 

Proteomic screens conducted for HP0231, HP1085, HP1568 did not provide 
data allowing assignation of a putative function and did not reveal homodimeric 
interaction underline the usefulness of the protein as a possible therapeutic 
target. Those genes remain ubiquitous, essential, H. pylori specific and without 
known function. In contrast, HP0394, HP0659 and HP1265 gave positive 
screens (Table 7). 

Classi fication of twelve of the 15 ubiquitous essential genes. 

The combination of both essentiality as well as the identification of 
interacting domains might serve as a direct pathway for the design of active 
compounds capable of inhibiting protein-protein interactions and possibly 
bacterial growth. Of the 15 ubiquitous essential. genes identified by the procedure, 
12 were shown to be involved in protein-protein interaction and could be 
classified in different categories with regards to their potential as putative 
therapeutic targets. 
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The first category consists of ubiquitous ORFs encoding proteins with 
heterodimeric protein-protein interactions in the two-hybrid assay where both 
partners are playing an essential role for the viability of H. pylori and at least one 
of the two partners is H. pylori specific. Four of the twelve essential ORFs answer 

5 these criteria hp0394, hp0862, hp1230, and hp0659 which encode proteins with 
no known or putative function. Although initially annotated as H. pylori specific, 
the recent publication of the Campylobacter genome demonstrates for some of 
them the existence of homologues in this closely phylogenetically related 
bacteria, but no homologues have been identified in the other bacterial genomes 

io so far sequenced. 

The hp1230 gene encodes a protein that has been recognized via the two 
hybrid assay as an homodimeric protein which interacts with the predicted 
chromosomal replication initiator protein, DnaA, encoded by hp1529. The 
proteomic screen allowed the identification of a specific domain of interaction 

15 (SID) lying between AA31 and AA180 of HP1230 (SID1230) and a SID of 87 AA 
within the N-terminal domain of HP1529 or DnaA (AA12 to AA99) (SID1529) 
(Rain et a/., 2001). To examine whether the HP1230/HP1529 interaction was 
specific and could serve as a target for screening of lead compounds with 
bactericidal activity, the oligonucleotide encoding SID1529 was randomly 

20 mutagenized, and selected, through the two-hybrid system, mutated sid that 
abolished the specific HP1230/HP1529 interaction. This allowed the 
demonstration of the fact that isoleucine 58 and Lysine 61 were involved in the 
HP1230/HP1529 interaction since a double mutant I58F/K61I within sid1529 
abolished this interaction. This mutagenesis random procedure also led to the 

25 isolation of a mutated SID1529* (V53L) which confers a superbinder phenotype 
to HP1529 in the two-hybrid system. When overexpressed in the three-hybrid 
system (Tirode et al., 1997) under the control of the regulated Met25 promoter in 
vector p3H1 (Colland et al., 2001) (Figure 3), the superbinder SID1529* 
completely inhibits the HP1230/HP1529 interaction. Thus, oligopeptide - 

?o PNQLLCTTITAKYG-( SEQ ID No. 6588) or overlapping or combinatory 
derivatives have some potential as lead compounds to inhibit H. pylori growth. 
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Another example of this category of interest is the hp0862 gene. The two- 
hybrid screen procedure revealed interactions between the HP0862 gene product 
and the C-terminal domain (AA100 to AA191) of the thymidylate kinase 
(HP1474), an essential enzyme responsible for the first phosphorylation step in 

5 the conversion of deoxythymidine 5'-monophosphate to deoxythymidine 5'- 
diphosphate for the final production of dTTP. As for HP1230, the actual function 
of the HP0862 encoding gene is unknown, but its essential character and 
interaction with a known essential enzyme (Tmk) might orient further functional 
analysis, and encouraged the definition of more precise domain of interactions 

io between the TmK protein and this new specific interacting protein. 

The hp394 and hp659 might also enter this category. They both encode 
protein essential specific to H. pylori. The hp394 encodes a protein whose C- 
terminal domain (last 76 AA) interacts with the ft subunit of the acetylcoenzyme A 
carboxylase transferase (HP950). This enzyme has an essential function in 

is membrane lipid synthesis and catalyses the formation of malonyl-CoA, the first 
intermediate for fatty acid synthesis . The protein encoded by hp0659 interacts 
with a putative outer membrane protein HP0655 with no known function. The 
essential property of hp0655 has not yet been tested and thus the classification of 
the hp0659 gene in this first category has to be confirmed. 

20 

- The second category includes genes essential for H. pylori encoding predicted 
proteins with known functions. The characteristic of this category is that these 
genes essential in H. pylori have not been reported to be lethal in other 
organisms. Thus targeting the proteins encoded by these genes might be 

25 relevant of a selective drug design specifically directed against H. pylori. Three of 
the twelve essential genes displayed this characteristic: hp0377 and hp0175, they 
both encode periplasmic proteins. Hp0377 encodes a protein partially homologue 
to DsbC a thiol-disulfide interchange periplasmic protein involved in disulfide 
bond formation (Zapun ef a/., 1995). So far, no other genes encoding Dsb-like 

?o proteins have been identified in the genome of H. pylori whereas in E. coli, as 
many as five proteins are required for disulfide bond formation in the periplasm. 
In E. coli, dsbC encodes a stable homodimer with both protein disulfide 
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isomerase and chaperone activities (McCarthy et a/., 2000), and is not essential 
probably due to redundancy of the function. In contrast, in H. pylori we 
demonstrate the essential character of that gene. The HP0377 product interacts 
in the two-hybrid assay with the last 100 C-terminal amino acids (SID) of the 
homodimeric secreted cysteine containing protein encoded by hp0224. The 
HP0224 product is a methionine sulfoxide reductase homologue (MsrA). In E. 
coli, MsrA plays a role in response to oxidative damage by reducing the 
methionine sulfoxide residues (Moskovitz et a/., 1995), and directly, or indirectly 
contributes to the maintenance of adhesins (Wizemann et a/., 1996). The H. 
pylori MsrA homologue (HP0224) was identified as one of the major antigens 
released into the extracellular space (Cao et a/., 1998). Thus, the HP0377 
encoding DsbC is a good H. pylori specific target candidate because of the 
essential character unique to the bacterial species, and the accessibility of the 
protein within the periplasmic space. 

The hp0175 gene is another representative of this category. The gene 
encodes a predicted peptidyl-propyl cis-trans isomerase, an enzyme that 
accelerates protein, folding by catalyzing the cis-trans isomerization of proline 
imidic peptide bonds in oligopeptides. Two-hybrid screening identified HP0608, a 
H. pylori specific protein of unknown function, as interacting partner with HP0175. 
The essential character of the gene, documented in this study, as for HP0377, 
has not been reported for other microorganisms and appears unique to the 
species. 

Finally, hp1265 is a very unique and specific target for H. pylori. This 
essential gene is part of a large cluster of 14 genes {hp1260 to hp1273) among 
which 12 encode homologues of 12 of the 14 subunits of the NADH 
oxidoreductase complex of E. coli. In E. coli inactivation of the genes encoding 
the different subunits (nuo) is not lethal for the bacterium. In H. pylori inactivation 
of hp1263 is not lethal, but inactivation of one of the two H. pylori specific subunit 
encoding gene is lethal, indicating that the HP1265 subunit which should be the 
subunit involved in the NADH binding has very unique properties deserving 
further functional investigations. 
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- A third category, consists in ORFs that encoded conserved hypothetical 
proteins, distributed in almost all the sequenced bacterial genomes, for which no 
function has been assigned, nor the essentiality assessed. We report here on the 
essential character of two of them the hp0739 gene which encodes a protein that 

5 interacts with another conserved hypothetical but non essential protein (HP810), 
and hp1012. In the two sequenced H. pylori genomes (26695 and J99), the 
hp0739 gene is flanked by two genes involved in the biosynthesis of the 
peptidoglycan, and its involvement in this biosynthetic pathway remains to be 
explored. The hp1012 gene encoded a protein, which has similitude with some 

o metallo-proteases, however the function of this specific protease is unknown. 
These two proteins with large distribution spectrum, so far unknown as essential 
genes, represent new putative targets. 

- Finally, a fourth category, includes ORFs with known functions that were 
previously shown to be essential in other microorganisms. The present work 
allows us to extend this property to the H. pylori species, and reinforces their 
value as putative targets with large spectrum. The hp0928 gene is one of those; 
it was selected through the two-hybrid screen as an homodimer. By similarity, 
the assigned function was that of GTP cyclohydrolase (folE) involved in the first 
step of the biosynthetic pathway of tetrahydrofolate, the structure of which was 
shown to be a homodecameric complex form of two pentamers. Both, the 
inability to succeed in knocking out that gene, and its oligomeric structure 
supports the assigned function. The two-hybrid screening procedure delineates a 
domain of interaction consisting of 133 amino-acids between the FolE subunits. 
This domain might serve as a therapeutical target for the screening of lead 
compounds with large bacterial spectrum. The hp1014 gene (hdhA) encodes an 
NAD + -dependent oxidoreductase belonging to the short-chain 
dehydrogenase/reductase (SDR 1 family). The enzyme is known to require a 
tetrameric form to be active in E. coli (Yoshimoto et a/., 1991) which was 
compatible with the homodimeric interaction observed in the two hybrid assay. 
The major interest of these two ORFs resides in their well known enzymatic 
activity, which provides a direct way to screen lead compounds capable of 
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abolishing the enzymatic activity through the disruption of the oligomeric 
interactions. The present work also classifies hp1245 as an essential gene of this 
category. This gene encodes the SSB protein, the single strand binding protein, 
involved in DNA replication, recombination and DNA repair, and such observation 
is confirmatory of work previously done in other model microorganisms. HP1245 
was found to interact with HP0650, a non-essential protein of unknown function, 
but also significantly with HP0661, a predicted ribonuclease H involved in DNA 
replication, a finding consistent with the HP1245 function. 

In order to fully illustrate the present invention and advantages thereof, the 
following specific examples are given, it being understood that the same are 
intended only as illustrative and in nowise limitative. 

EXAMPLES 

Example 1: Preparation of a Helicobacter pylori genomic collection 

1.A. Fragment collection preparation arid transformation in E. coli 
1.B. Collection transformation in Saccharomyces cerevisiae 
1 .C. Construction of bait piasmid 
Example 2: Screening the collection with the two-hybrid in yeast system 
2.A. The mating protocol 
2.B. The X-Gal overlay assay 

2. C. The luminometry assay 
Example 3 : Identification of positive clones 

3. A. PCR on yeast colonies 

3.B. Piasmid rescue from yeast by electroporation 
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Example 4 : Detection of protein-protein interaction 

Example 5 : Identification of SID® 

Example 6 : Screening of modulating agent 

Example 7: Gene therapy Example using SID® polypeptides 

5 Example 8: Making of polyclonal and monoclonal antibodies 

Example 9 : Classification of genes of H. pylori 

Example 10 : Study of the interaction between two essential 
genes, HP1230-HP1529, by random mutagenesis. 

Medium composition and standard protocols are available in 
o Maniatis etai. 

Example 1: Preparation of a Helicobacter pylori genomic collection 

1.A. Collection preparation and transformation in Escherichia coli 
1.A.1. Fragment of genomic DNA preparatio n 

The Helicobacter pylori genomic DNA is fragmented in a nebulizer (GATC) 
for 1 minute, precipitated and resuspended in water. 

The obtained nebulized genomic DNA is successively treated with Mung 
Bean Nuclease (Biolabs) (30 minutes at 30°C), T4 DNA polymerase (Biolabs) (10 
minutes at 37°C) and Klenow enzyme (Pharmacia) (10 minutes at room 
temperature and 1 hour at 16°C). 

DNA is then extracted, precipitated and resuspended in water. 

1.A.2. Ligation of linkers to blunt-ended genomic DNA 

Oligonucleotide PL160 (5' end phosphorylated) 1 ug /pi and PL159 
2pg/yl. 
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Sequence of the oligo PL160 : 5'-ATCCCGGACGAAGGCC-3' (SEQ ID NO. 3257) 

Sequence of the oligo PL1 59 : 5-GGCCTTCGTCCGG-3' (SEQ ID NO. 3258) 

Linkers were preincubated (5 minutes at 95°C, 10 minutes at 68°C, 15 
minutes at 42°C) then cooled down at room temperature and ligated with 
5 genomic DNA inserts at 4°C overnight. 

Linkers were further removed on a separation column (Chromaspin TE 400, 
Clontech), according to the manufacturers protocol. 

1.A.3. Vector preparation 

pACTIIst is successively digest with BamHI restriction enzyme (Biolabs) for 
io 1 hour at 37°C, dephosphorylated with Calf Intestine Phosphatase (CIP) (Biolabs) 
and filled in with dGTP using Vent DNA polymerase (exo-) (Biolabs), extracted, 
precipitated and resuspended in water. 

1.A.4. Liga tion between vector and insert ofpernomic DNA 

The prepared vector is ligated overnight at 15°C with the genomic blunt 
15 ended DNA described in section 2 using T4 DNA ligase (Biolabs). The DNA is 
then precipitated and resuspended in water. 

1.A.5. Library transformation in Escherichia coli 

Transform DNA from section 1.A.4 into Electromax DH10B 
electrocompetent cells (Gibco BRL) with a Cell Porator apparatus (Gibe BRL). 
20 Add 1 ml SOC medium and incubate transformed cells at 37°C for 1 hour. Add 9 
ml volume of SOC medium per tube and plate on LB+ampicillin medium. Scrape 
colonies with liquid LB medium. Aliquot and freeze at -80°C. 

The obtained collection of recombinant cell clones was named HGXBHP1 
(CNCM N° 1-2181). 



1.B. Collection transfor mation in Saccharomvces cerevisiae 
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The Saccharamyces cerevisiae strain (Y187 (MATa GalA Gal80A ade2- 
101 His3 Leu2-3, -112 Trp1-901 Ura3-52 URA3::UASGALl-LacZ Met)) 
transformed with the HGXBHP1 H. pylori genomic DNA library. 

The plasmid DNA contained in E, coli were extracted (Qiagen) from 
aliquoted E. coli frozen ceils (1.A.5.). 

Grow Saccharomyces cerevisiae yeast Y187 in YPGIu. 

Yeast transformation is performed according to standard protocol (Giest et 
al. Yeast, 11, 355-360, 1995) using yeast carrier DNA (Clontech). This 
experiment leads to 10 4 to 5 10 4 cells/pg DNA. Spread 2 10 4 cells on DO-Leu 
medium per plates. Aliquot and freeze at -80°C. 

1. C. Construction of bait plasmid 

The genomic amplification of the ORF is obtained by PCR using the Pfu 
proofreading Taq polymerase (Stratagene) and 200 ng of genomic DNA as the 
template. PCR primers are chosen in regions flanking the ORF. 

Set up the PCR program as followed: 



72° 6' | 

72° 10' 
15° x 

Check amplification on agarose gel. 

Purify PCR fragments with Qiaquick column (Qiagen) according to the 
manufacturer's protocol. 

Digest purified PCR fragments with adequate restriction enzymes. 

Purify PCR fragments with Qiaquick column (Qiagen) according to the 
manufacturer's protocol. 




x 30 cycles 
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Ligate digested PCR fragments into an adequately digested and 
dephosphorylated bait vector (pAS2AA) according to standard protocol (Maniatis 
et al.). 

Transform into competent bacterial cells. Grow cells, extract DNA and sequence 
5 plasmid. 

Example 2: Screening the collection with the two-hybrid in yeast system 

2.A. The mating protocol 

We have chosen the mating two-hybrid in yeast system (first described by 
Legrain et al., Nature Genetics, 1997, vol. 16, 277-282, Toward a functional 
io analysis of the yeast genome through exhaustive two-hybrid screens) for its 
advantages but we could also screen the Helicobacter pylori collection in the 
classical two-hybrid system as described in Fields et al, or in a yeast reverse two- 
hybrid system. 

The mating procedure allows a direct selection on selective plates 
is because the two fusion proteins are already produced in the parental cells. No 
replica plating is required. 

This protocol is written for the use of the library transformed into the Y187 
strain. Before mating, transform S. cerevisiae (CG 1945 strain (MATa Gal4-542 
Gall 80-538 ade2-101 His3*200 Leu2-3,-112 Trp1-901 Ura3-52 Lys2-801 
20 URA3::GAL4 17mers (X3)-CyC1TATA-LacZ LYS2::GAL1 UAS-GAL1TATA-HIS3 
CYH R )) according to step 1.B and spread on DO-Trp medium. 

Day 1, morning : preculture 

Preculture of Y187 cells carrying the bait plasmid obtained at step 1.C. in 20 ml 
DO-Trp medium. Grow at 30°C with vigorous agitation. 
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Day 1, late afternoon : culture 

Measure OD S oonm of the DO-Trp preculture of Y187 cells carrying the bait 
plasmid preculture. The OD 6 oonm must lie between 0.1 and 0.5 in order to 
correspond to a linear measurement. 

5 Inoculate 150 ml DO-Trp at OD S00 nm 0.006/ml, grow overnight at 30°C with 
vigorous agitation. 

Day 2 ; mating 

Medium and plates 

5 YPGIu plates 
10 50 ml tube with 30 ml DO-Leu-Trp-His 

100 ml flask with 20 ml of YPGIu 

75 DO-Leu-Trp-His plates 

2 DO-Leu plates 2 DO-Trp plates 

2 DO-Leu-Trp plates 

15 Measure OD 60 onm of the DO-Trp culture. It should be around 1 . 

For the mating, you must use twice as many bait cells as library cells. To 
get a good mating efficiency, you must collect the cells at 10 8 cells per cm 2 . 

Estimate the amount of bait culture (in ml) that makes up 80 OD 6 oonm units 
for the mating with the prokaryote library. 

20 Thaw a vial containing the HGXYHP1 library slowly on ice. Add the 

contents of the vial to 20 ml YPGIu. Let those cells recover at 30°C, under gentle 
agitation for 10 minutes. 

Mating 



Put the 80 OD 600 nm units of bait culture into a 250 ml flask. 
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Add the HGXYHP1 library culture to the bait culture. Transfer the mixture 
of diploids into 50 ml sterile tubes. Centrifuge, discard the supernatant and 
resuspend in YPGIu medium. 

Distribute cells in 400 \i\ samples in YPGIu plates with glass beads. 
5 Spread cells by shaking the plates. 

Incubate plates cells-up at 30°C for 4h30min. 

Collection of mated cells 

Wash and rinse plates and spread collected cells on DO-Leu-Trp-His+Tet 
plates. 

10 Day 4 

Selection of clones able to grow on DO-Leu-Trp-His+-Tetracyclin : this medium 
allows us to isolate diploid clones presenting an interaction. 

Count the His+ colonies on control plates. 

The number of His+ cell clones will define which protocol is to be processed: 

15 Upon 20 X10 6 His+ colonies 

- if the number of His+ cell clones > 285 : then process overlay and then 
luminometry protocols on blue colonies (2.B and 2.C). 

- If the number of His+ cell clones < 285 : process luminometry 
protocol (2.C). 

20 - The following step leads to the selection of the strongest interaction. 

2.8. The X-Gal overlay assay 

The X-Gal overlay assay is performed directly on the selective medium plates 
after scoring the number of His+ colonies. 

Material 
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Set up a waterbath. The water temperature should be 50°C. 

• 0.5 M Na 2 HP0 4 pH 7.5. 

• 1 .2% Bacto-agar. 
. 2% X-Gal in DMF. 

5 • Overlay mixture : 0.25 M Na 2 HP0 4 pH7.5, 0.5% agar, 0.1% SDS, 7% DMF 
(LABOSI), 0,04% X-Gal (ICN). For each plate, 10 ml overlay mixture are needed. 

• DO-Leu-Trp-His plates. 

• Sterile toothpicks. 
Experiment 

io Temperature of the overlay mix should be between 45 and 50°C. 
Pour the overlay-mix over the plates in portions of 10 ml. 
Collect them when the top layer is settled. 
Incubate plates overlay-up at 30°C. Note the time. 

Check for blue colonies regularly. If no blue colony appears, wait for overnight 
15 incubation. Mark with a pen and number the positives. 

Streak the positives colonies on fresh DO-Leu-Trp-His plates with a sterile 
toothpick. 

2. C. The luminometry assay 

Grow His+ colonies overnight at 30°C in microtiter plates containing DO- 
20 Leu-Trp-His+Tetracyclin medium with shaking. The day after, dilute 15 times 
overnight culture into a new microtiter plate containing the same medium. 
Incubate 5 hours at 30X with shaking. Dilute samples 5 times and read 
ODeoonm. Dilute again to obtain between 10,000 and 75,000 yeast cells/well in 
100 ul final volume. 
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Per well, add 76 ul of One Step Yeast Lysis Buffer (Tropix), '20 ui Sapphirell 
Enhancer (Tropix), 4 ui Galacton Star (Tropix), incubate 40 minutes at 30°C. 
Measure the B-Gal read-out (L) using a Luminometer (Trilux, Wallach). 

Calculate value of OD 60 o n mxL and select interacting preys having the highest 
values. 

At this step of the protocol, we have isolated diploid cell clones presenting 
interactions. The next step is now to identify polypeptides involved in the selected 
interactions. 

Example 3: Identification of positive clones 

3.A. PCR on yeast colonies 

Introduction 

PCR amplification of fragments of plasmid DNA directly on yeast colonies 
is a quick and efficient procedure to identify sequences cloned into this plasmid. It 
is directly derived from a published protocol (Wang H. et al., Analytical 
Biochemestry, 237, 145-146, 1996). However, it is not a standardized protocol : in 
our hands it varies from strain to strain, it is dependent of experimental conditions 
(number of cells, Taq polymerase source, etc). This protocol should be optimized 
to specific local conditions. 

Materials 

- For 1 well, PCR mix composition is: 
32.5 ui water, 

5 ul 10X PCR buffer (Pharmacia), 
1 ui dNTP 10 mM, 

0.5 ui Taq polymerase (5ul /ul) (Pharmacia), 
0.5 ul oligonucleotide ABS1 10 pmole/ui: 
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5-GCGTTTGGAATCACTACAGG-3' (SEQ ID NO. 3259) 
0.5 ui oligonucleotide ABS2 10 pmole/ui: 
5-CACGATGCACGTTGAAGTG-3' (SEQ ID NO. 3260) 
- 1 N NaOH. 
5 Experiment 

Grow positive colonies overnight at 30°C on a 96 well cell culture cluster (Costar), 
containing 150 ui DO-Leu-Trp-His+Tetracyclin with shaking. Resuspend culture 
and transfer immediately 100 ui on a Thermowell 96 (Costar). 

Centrifuge 5 minutes at 4,000 rpm at room temperature. 

10 Remove supernatant. Dispense 5 ui NaOH in each well, shake 1 minute. 

Place the Thermowell in the thermocycler (GeneAmp 9700, Perkin Elmer) 5 
minutes at 99.9°C and then 10 minutes at 4°C. 

In each well, add PCR mix, shake well. 

The PCR program was set up as followed : 



94°C 


3 minutes 


94°C 


30 secondes 


53°C 


1 minute 30 secondes 


72°C 


3 minutes 


72°C 


5 minutes 


15°C 





The quality, the quantity and the length of the PCR fragment was checked 
on an agarose gel. The length of the cloned fragment was the estimated length of 
the PCR fragment minus 300 base pairs that corresponded to the amplified 
flanking plasmid sequences. 

3.B. Plasmids rescue from veast by electroporation 
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Introduction 

The previous protocol of PCR on yeast cell may not be successful, in such 
a case, plasmids from yeast by electroporation can be rescued. This experiment 
allows the recovery of prey plasmids from yeast cells by transformation of E. coli 
5 with a yeast cellular extract. The prey plasmid can then be amplified and the 
cloned fragment can be sequenced. 

Material 

Plasmid rescue 

10 Glass beads 425-600 urn (Sigma) 

Phenol/chloroform (1/1) premixed with isoamyl alcohol (Amresco) 

Extraction buffer: 2% Triton X100, 1 % SDS, 100 mM NaCI, 10 mM TrisHCI pH 
8.0, 1 mM EDTA pH 8.0. 

Mix ethanol/NH 4 Ac : 6 volumes ethanol with 7.5 M NH 4 Acetate, 70% Ethanol and 
15 yeast cells in patches on plates. 

Electroporation 

SOC medium 

M9 medium 

Selective plates: M9-Leu+Ampicillin 
20 2 mm electroporation cuvettes (Eurogentech) 
Experiment 

Plasmid rescue 

Prepare cell patch on DO-Leu-Trp-His with eel! culture of section 2.C. 
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Scrape the cell of each patch into an Eppendorf tube, add 300 ul of glass beads 
in each tube, then, add 200 ul extraction buffer and add 200 ul phenol:chloroform: 
isoamyl alcohol (25:24:1). 

Centrifuge tubes 10 minutes at 15,000 rpm. 

Transfer 180 ul supernatant to a sterile Eppendorf tube and add to each 500 ul 
ethanol/NH 4 Ac, vortex. 

Centrifuge tubes 15 minutes, 15,000 rpm at4°C 
Wash pellet with 200 ul 70% ethanol, remove ethanol and dry pellet. 
Resuspend pellet in 10 ul water. Store extracts at -20°C. 
Electroporation 

Material : Electrocompetent MC1066 cells prepared according to standard 
protocols (Maniatis). 

Add 1 ul of yeast plasmid DNA-extract to pre-chilled Eppendorf tube, and keep on 
ice. 

Mix 1 ul plasmid yeast DNA-extract sample, add 20 ul electrocompetent cells and 
transfer in a cold electroporation cuvette. 

Set the Biorad electroporator on 200 ohms resistance, 25 uF capacity; 2.5 kVolts. 
Place cuvette in the cuvette holder and electroporate. 

Add 1 ml SOC into the cuvette and transfer the cell-mix into a sterile Eppendorf 
tube. Let cells recover for 30 minutes at 37°C, spin the cells down 1 minute, 
4,000x g and pour off supernatant. Keep about 100 ul medium and use it to 
resuspend the cells and spread them on selective plates (e.g., M9-Leu plates). 

Incubate plates for 36 hours at 37°C. 

Grow one colony and extract plasmids. Check presence and size of insert 
through enzymatic digestion and agarose gel. Sequence insert. 
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Example 4: Protein-protein interaction 

For each bait, the previously protocol leads to the identification of prey 
polynucleotide sequences. In order to identify a protein-protein interaction, we 
need to characterize the obtained prey polypeptide sequence regarding the 
5 Helicobacter pylori genome. 

This may be accomplish with a software program names blastwun 
(available on the Internet site of the University of Washington 

htrp: / 'bir)\veb.p:^rcur.h /scqanal ' interfaces 'blasrwu.hrm. this is a development 

version of software for gene and protein identification through similarity searches 
10 of protein and nucleotide sequence databases). 

Blastwun program compares prey polynucleotide insert sequence (rescued 
from prey plasmid) with whole Helicobacter pylori genome (available on NCBI 
web site: imp:/. 'www.ncbi.nlm. nih.gov under GenBank accession number 
AE000511). This comparison leads to prey polynucleotide localizations in the H. 
15 Pylori genome, each localization having a score depending on the homology of 
sequence. For each prey polynucleotide, we consider the localization with the 
highest score and, if the insert sequence is included in and is in phase with an 
Open Reading Frame, we can identify one prey polypeptide interacting with one 
bait polypeptide. 

20 Helicobacter pylori ORF's sequences are available on the World-Wide 

Web site of The Institute for Genomic Research (TIGR) at 

hrrp: .' www.ngr.org/ tdb .'mbd / hpdb - hpdb.hclm . 

This web page allows several requests concerning Helicobacter pylori's 
genome, in particular, its ORF sequence. To get the sequences of specific 
25 ORF's, click on the ewindow named "HP#" and click search. This operation leads 
to a new web page presenting nucleic and peptide sequence of the specific ORF. 

See Table 1 : Protein-protein interactions in Helicobacter pylori. 



Example 5: Identification of SID® 
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Experiment results in step 4. sequences of each prey fragment encoding 
for an interacting prey polypeptide. 

By comparing and selecting the intersection of every isolated fragments 
that are included in the same polypeptide, we define the Selected Interacting 
Domain (SID®) see Figure 15. 

See results in Table 2. 

Example 6: Screening of modulating agent 

Select one specific interaction. 

Transform a permeabilized yeast cell with plasmids containing bait 
polypeptide and prey polypeptide of the specific interaction. 

Plate a top agar containing transformed permeabilized yeast cells on square 
boxes (that already contains agarose gel). 

Apply by spotting the compounds to test on top agar as soon as it 
is solidified. Incubate overnight at 30°C. 

Analyse results : select lead compounds that prevent transformed 
permeabilized yeast cells from growing. 

Example 7: Gene therapy Example using SID® polypeptides 

An expression vector containing the SID® polynucleotide is made in the 
manner described in U.S. Patent 4,980,286. It is then administered to patients to 
treat H. pylori infections. 

Example 8: Making of polyclonal and monoclonal antibodies 

The protein-protein complex of Table 1 was injected into mice and 
polyclonal and monoclonal antibodies were made following the procedure set 
forth in Sambrook et al supra. 

More specifically mice are immunized with an immunogen comprising 
complexes conjugated to keyhole limpet hemocyanin using glutaraldehyde or 
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EDC as is well known in the art. The complexes can also be stabilized by 
crosslinking as described in WO 00/37483. The immunogen is then mixed with an 
adjuvant. Each mouse receives four injections of 10 ug to 100 [.ig of immunogen, 
and after the fourth injection, blood samples are taken from the mice to determine 
5 if the serum contains antibodies to the immunogen. Serum titer is determined by 
ELISA or RIA. Mice with sera indicating the presence of antibody to the 
immunogen are selected for hybridoma production. 

Spleens are removed from immune mice and single-cell suspension is 
prepared (Harlow et al 1988). Cell fusions are performed essentially as described 

10 by Kohler et al. Briefly, P365.3 myeloma cells (ATTC Rockville, Md) or NS-1 
myeloma cells are fused with spleen cells using polyethylene glycol as described 
by Harlow et al. Cells are plated at a density of. 2 x 10 5 cells/well in 96-well tissue 
culture plates. Individual wells are examined for growth and the supernatants of 
wells with growth are tested for the presence of Table 1 complex-specific 

15 antibodies by ELISA or RIA using the Table 1 complex as a target protein. Cells 
in positive wells are expanded and subcloned to establish and confirm 
monoclonality. 

Clones with the desired specificities are expanded and grown as ascites in 
mice or in a hollow fiber system to produce sufficient quantities of antibodies for 
20 characterization and assay development. Antibodies are tested for binding to bait 
polypeptide (from column 1 of Table 1) alone or to prey polypeptide (from column 
2 of Table 1) alone, to determine which are specific for the Table 1 complex as 
opposed to those that bind to the individual proteins. 

Monoclonal antibodies against each of the complexes set forth in Table 1 
25 are prepared in a similar manner by mixing specified proteins together, 
immunizing an animal, fusing spleen cells with myeloma cells and isolating clones 
which produce antibodies specific for he protein complex, but not for individual 
proteins. 



Example 9: Classification of genes of H. pylori within function categories at 
30 the genomic scale using 2 exhaustive libraries in E. coli 
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1 . Bacterial strains, growth and storage conditions. 

Escherichia coli strains DH5a (BRL) HB101 (Boyer and Roulland-Dussoix, 
1969) and NS2114 (Rif) (Seifert et a/., 1986) were used as hosts for plasmid 
cloning and disruption experiments and were grown at 37°C in L-broth (10 g of 

5 tryptone, 5 g of yeast extract and 5 g of NaCI per liter, pH 7.0) or on L-agar plates 
(1.5% agar) at 37°C. Antibiotics were used at the following final concentrations 
(ug/ml) unless indicated in the text: spectinomycin:100 (Upjohn Laboratories, 
Pahs, France), tetracycline: 8 (Sigma Chemicals, Saint-Quentin Fallavier, 
France), kanamycin: 25 (Serva, Frankfurt, Germany), rifampicine:100 ug (Sigma 

10 Chemicals). Independent recombinant E. coli were saved by storing up to 96 
clones individually in 96-well microtitre plates; clones were inoculated into L-broth 
supplemented with 8 jug/ml tetracycline, 100 \ig/m\ spectinomycin and 7% DMSO 
(Sigma) and stored at -80°C. H. pylori strain 26695 (Tomb era/., 1997), HAS141 
(Janvier et a/., 1999), N6 (Ferrero et a/., 1992), X47-2an (GUY et al., 1999) were 

is routinely cultured on 10% horse blood agar medium (Blood Agar Base no. 2; 
Oxoid, Lyon, France) or in Brucella broth supplemented with 10 % Fecal Calf 
Serum (Gibco). Solid and liquid media contained supplements at the following 
final concentrations: 10 ^g vancomycin (Dakota Pharmaceuticals, Creteil, 
France), 2.5 IU polymyxin (Pfizer Laboratories, Orsay, France), 5 jag trimethoprim 

20 (Sigma) and 4 \xg amphotericin B (Bristol-Myers Squibb, Paris, France)/ml. 
Plates were incubated at 37°C under microaerobic conditions in an anaerobic jar 
with a carbon dioxide generator (CampyGen, Oxoid) without catalyst. H. pylori 
that had undergone chromosomal allelic exchange were selected on medium 
supplemented with 25 fig kanamycin. 

Production of amplicons corresponding to each of the 1590 ORFs originally 
identified by TIGR in strain 26695: 1621 pairs of forward and reverse 
oligonucleotides targeting the 1590 ORFs of the genome of strain 26695, as 
assigned on the Webb site in 1997, were designed and synthesized by 
30 Eurogentec (Bel S.A., Seraing, Belgium). Pairs (sense and antisense, Table 3) of 
oligonucleotides were designed in order to allow full length amplification of each 
of the ORFs, with the exception of ORFs with a size over 3 kb that were split into 
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two or three PCR products. Every forward and reverse 21 bp-oligonucleotide 
was tagged respectively at its 5'extremity with a CAUCAUCAU (Seq ID No.3261) 
and a CUACUACUA (SEQ ID No. 3262) sequence. Genomic DNA of Hp strain 
26695, prepared from cesium chloride extraction (Labigne-Roussel et a/., 1988), 
5 was used as a template for PCR. Seventeen times 96 amplicons were produced 
by polymerase chain reactions (PCR) using a PCR-express thermal cycler 
through 40 cycles consisting of a denaturation step of 94°C for 2 mn, a primer 
annealing step of 50°C for 30 sec. and an extension step at 72°C for 2 mn, under 
a 96-well format. The amplicons were controlled for size and quality (single 
10 band) on agarose gel. 

Systematic UDG cloning of the ORFs of H. pylori strain 26695 

The cloning of the 96 amplicons was performed using the ligation-independent 

15 method described by Rashtchian (Rashtchian, 1995). First the linear plLL570-» 
derivative vector was prepared by gene amplification using #570-1 and #570-2 
(Table 4) as primers and the plLL570 plasmid (Labigne ef a/., 1992) as a 
template. Three microliters of individual HP0001 to HP1590 PCR products were 
mixed together with 2 ul of plLL570-« derivative vector (75 ng), 14 ul of 1XPCR 

20 buffer, 1 pi of uracil DNA glycosylase (UDG) in a 2000-ul 96-well disposable 
plate. The plates were' incubated at 37°C for 30 mn allowing the enzymatic 
reaction as well as the hybridization between protruding and complementary 
extension hybridization to occur. Competent DH5a cells (100 pi) harboring the 
pTCA plasmid (Seifert ef a/., 1986) were added to each well, and the 96-well 

25 plate was further incubated for 45 mn on ice. One ml of prewarmed L-broth was 
added to each well, and the plate was then incubated for 90 mn at 37°C. Finally, 
a selective antibiotic cocktail containing spectinomycin, tetracycline was added to 
each well to positively select and enrich in plLL570-» derivative recombinant 
plasmid transformed DH5cc(pTCA) cells; plates were then incubated for another 

30 13 hours at 37°C under agitation. Individual transformant colonies were isolated 
by spotting 10 pi of liquid culture from each well on square agar plates containing 
tetracycline and spectinomycin using a 96-we!l inoculator designed to deliver a 10 
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(il liquid volume; cloning of the PCR product was confirmed by mini-preparation 
recombinant plasmid restricted with Cla\-Ava\. They were stored in DMSO (7%) 
at -80°C under a 96-well format as "library I" consisting of plate 1.1 to plate 
XVII. 1. 

Systematic disruption of the recombinant plasmids by transposon mutagenesis 
in E. coii. Transposon mutagenesis of individual E. coli clones was performed 
using the mini-Tn3-Km transposon as previously described by Jenks et al. (jenks 
et al., in press). All manipulations were performed in a 96-well format and four 

io independent transposon mutageneses were carried out in parallel so as to 
saturate the mutagenesis disrupting process with independent events. Briefly, 
the stored microtitre plates containing the individual E. coli DH5a clones that 
harbored the pTCA plus the recombinant plLL570-« derivative plasmids were 
thawed and used to inoculate fresh plates. Plasmid plLL553 harboring the mini- 

15 Tn3-Km transposon (Seifert ef al., 1986) (Labigne, 1997) (a low copy auto- 
transferable plasmid pOX38 derivative) was transferred into these E. coli DH5a 
clones by conjugation. Transconjugates harboring all three plasmids 
(recombinant plLL570-» derivative, pTCA and plLL553), were selected by 
spotting 10 pi of the mating mixture on L-agar containing 25 jug/ml kanamycin, 8 

20 i-ig/ml tetracycline and 100 ng/ml spectinomycin. Cointegrates were transferred 
by conjugation into E. coli NS2114SmRif carrying the ere gene. Positive 
selection' of resolved forms of the cointegrates was obtained by growth on L-agar 
containing 100 |ag/ml rifampicin (Sigma), 625 ug/ml kanamycin, 625 ng/ml 
spectinomycin and 625 ng/mi streptomycin. A pool of each TnKm disrupted 

25 recombinant plasmids was stored as individual stock of disrupted ORFs at -80°C 
in DMSO (7%), designated "library II" consisting of plate 1. 2 to plate XVII. 2. 

DNA preparation and standard molecular biology techniques. The alkaline 
lysis procedure (Sambrook et al., 1989) was used for small-scale preparation and 
jo MIDI Qiagen (Qiagen, Courtaboeuf, France) columns were used for large-scale 
plasmid preparation. Whole cell genomic DNA from individual generated H. pylori 
mutants was extracted using the QIAamp Tissue Kit (Qiagen) according to the 
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manufacturer's instructions. Standard procedures for DNA/DNA hybridization 
were prepared on nitrocellulose membranes (Schleicher and Schuell,) according 
to the procedure of Sambrook et al. (Sambrook et a/., 1989). They were 
hybridized under standard conditions with [a- 32 P]-deoxyribonucleotide probes 
labeled by random priming using the MegaPrime DNA system (Amersham) 
according to the manufacturer's instructions. 

Transformation of H. pylori. H. pylori strains were naturally transformed with 
circular plasmid DNA (- 2 ug per transformation). Briefly, bacteria were 
inoculated as 1 cm patches and grown for 5 h before addition of 10 jil supercoiled 
plasmid DNA. Each disrupted plasmid consisting either of a. pool of disrupted 
plasrnids when originating from library II or of a single recombinant plasmid for 
the non polar mutation construction was added to 4 independently prepared 
patches of H. pylori. After further incubation for 18 h, the bacteria from each 
individual patch were harvested and plated directly onto a single plate of selective 
medium (kanamycin, 25 ug/ml). Six individual kanamycin transformants were 
then subcultured. Chromosomal DNA was extracted using QiAmp kit extraction, 
and the constructed mutant characterized by several PCR controls and/or 
hybridization as described in the result section. 

Introduction of a non polar mutation in a selected ORF (HP000X) 
For a given cloned ORF (HP000X) present in Library I, the recombinant 
plasmid was prepared and used as a template for reverse PCR performed with 
two oligonucleotides. Oligonucleotide HP000X-1 consists of 24 nucleotides 
complementary to an intragenic sequence located 300 bp downstream the 5'-end 
of the gene, and HP000X-2 consists of 24 nucleotides complementary to an 
intragenic sequence located 300 bp upstream of the 3'-end of the gene. All the 
HP00X-1 and HP000X-2 oligos were tagged respectively at their 5'-end with a 
CGGGGTACC (SEQ ID No. 3261) (Kpn\) sequence and at their 3'-end with a 
CGCGGATCC (SEQ ID No. 3262) (BamH\) sequence (Table 4). Following 
reverse amplification with high fidelity Taq polymerase (Boerhinger) from the 
given cloned ORF, 5 pi of PCR product were restricted with Dpnl in order to 
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eliminate the template molecule, then restricted with Kpn\ and BamHI and directly 
ligated to the promoterless non polar Kpn\-BamH\ kanamycin cassette (0.9 kb) 
previously described (Skouloubris ef a/., 1998). Transformants were selected on 
spectinomycin containing plates. Recombinant plasmids were then purified, 
5 controlled, and introduced by natural transformation into H. pylori cells. 

Example 10 - Study of the interaction between two essential genes, 
HP123Q-HP1529, by random mutagenesis. 

1. Preparation of mutagenized SID1529 collection 

10 A. Collection preparation and transformation in Escherichia coli 

i. A Random mutagenesis of SID1529 by PCR 

Mutagenized SID1529 was obtained by PCR using the Tag polymerase 
(Strataqene) and 200 nq of Helicobacter pylori genomic DNA and the following 
oligonucleotides : 

15 5'-ATTTGCGGCCGCAATCTTGGCGCTAGTCAAACAA-3' (SEQ ID No. 3263) 
5'-CCGGGATCCTCAAGATTGGGCGTTAATTTGGAT-3' (SEQ ID No. 3264) 
The PCR program was set up as follows : 



94° r 

45° 1' x 30 cycles 
75° V 
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The PCR conditions were as follows : 

KCI (50 mM), Tris pH 9.0 (10 mM), MgCI 2 (7 mM), MnCI 2 (0.2 mM) 



The amplification was checked by agarose gel electrophoresis. 



B. Digestion of mutagenized SID1529 and pP7-centro vector 



The PCR fragments were purified with Qiaquick column (Qiagen) according to the 
manufacturer's protocol and digested (Not\-BamH\). The vector (pP7-centro) 
(see, Figure 20) was digested (Not\-BamH\) and dephosphorylated according to 
standard protocol (Sambrook eta!.). 



C. Library transformation in Escherichia coli 



The method followed in this section is the same as the one described in 
Example 1. A. 5 above. 



2. Collection transformation in Saccharomyces cerevisiae 



The method followed in this section is the same as the one described in 
Example 1.B above. 



3. Construction of HP1230 bait plasmid 
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The genomic amplification of the HP1230 ORF was obtained by PCR 
using the Pfu proofreading Taq polymerase (Stratagene) and 200 ng of 
Helicobacter pylori genomic DNA as template. 



The PCR program was set up as follows: 

94° 45" 
94° 45" 



x 30 cycles 



48° 45" 
72° 6' 
72° 10' 

The amplification was checked by agarose gel electrophoresis. 

The PCR fragments were purified with Qiaquick column (Qiagen) according to the 
manufacturer's protocol. The digested PCR fragments were ligated into an 
adequately digested (BamHI-Pstl) and dephosphorylated bait vector (pB1) 
according to standard protocol (Sambrook et a/.) and were transformed into 
competent bacterial cells. The cells were grown , the DNA extracted and the 
plasmid was sequenced. ' 

4. Screening the mutagenized SID1529 collection with the HP1230 bait protein 
using two-hybrid in yeast 

The method followed in this section is the same as the one described in 
Example 2 above, with the exception that DO Leu-Trp-His + Tet plates were 
replaced by DOLeu-Trp-His + Tet + 40 mM 3-AT plates. 

5. Identification of positive clones 



The method followed in this section is the same as the one described in 
Example 3 above. 
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6. The SID 1529* (V53L) inhibits the interaction between HP1529 and HP1230 

In the three-hybrid system, the HP1529 protein is expressed fused to the 
GAL4 Activation Domain (AD) in the pP6 plasmid, whereas HP1230 is introduced 
in the p3H1 vector in fusion with the DNA-binding domain (DBD) of GAL4. In 

5 addition, this vector contains the Met25 promoter which allow expression of a 
third partner in medium lacking methionine. After transformation of Y187 and 
CG1945 yeast by the pP6-HP1529 and p3H1-HP1230 vectors, respectively, both 
strains were mated. The resulting diploid strain was grown on a minimal medium 
lacking leucine and tryptophan to select for both plasmids (DO-2) and on DO-2 

10 without histidine to select for interaction (DO-3). As a positive control, this strain 
was observed to grow on the selective medium for dilutions ranging from 1 to 10' 4 
(Figure 19, lane 1). This result shows an interaction between HP1230 and 
HP1529 proteins, as previously identified using library screening (Rain ef a/., 
2001). 

is Two different plasmids were used for this study: (i) the pP6 vector which 

contain the GAL4 activation domain (AD) (Rain ef a/., 2001). One of the HP1529 
fragments (nucleotides 1-1374) obtained by screening the HP 1230 protein was 
selected and used as prey in the pP6 vector fused to GAL4 AD; (ii) the p3H1 
vector which contains the DNA-binding domain (DBD) of GAL4 and a methionine- 

20 regulated Met25 promoter (Tirade ef a/., 1997, J. Biol. Chem. 272: 22995-22999). 
The HP1230 encoding sequence of 540 bp was sub-cloned from pB1-HP1230 
into the BamHUPst\ sites of p3H1 as fusion protein with GAL4-DBD giving p3H1- 
HP1230. In addition, the WT SID1529 or SID1529* (N38D-V53L) or SID1529* 
(V53L) were sub-cloned from pP7-centro (Notl-BamHI) to the NotUBglU sites of 

25 p3H1-HP1230 under the control of the Met25 promoter. Expression from the 
Met25 promoter is obtained in the absence of methionine. As negative control, 
we used a prey encoding the HP0875 protein. All PCR fragments and in frame 
fusions were checked by sequencing. 

The pP6 and p3H1 derived-vectors were used to transform the Y187 and 
30 CG1945 yeast strains, respectively. Both strains were mated in YPD buffer 
(Yeast Peptone Dextrose; Bio 101, Inc) for 4 hours at 30 D C and the resulting 
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diploid strain was selected on a minimal medium lacking leucine and tryptophane 
(DO-2). The interaction between proteins was observed in plates containing DO- 
2 deleted in histidine (DO-3) without methionine. 

To assay whether The different versions of SID1529 can modulate this 
5 interaction, this protein was cloned in the p3H1-HP1230 vector under the control 
of the Met25 promoter. In the presence of the WT SID1529, growth was observed 
on D03-Met medium thus showing that the WT SID1529 had little or no effect on 
the HP1230-HP1529 interaction (Figure 19, lane 2). In contrast, cells transformed 
with pP6-HP1529 and p3H1-HP1230-SID1529*(N38D-V53L) or p3H1-HP1230- 
io SID1529* (V53L) were almost unable to grow on the selective medium (Figure 
19, lanes 3-4). The growth of both strains in selective medium and in the 
presence of methionine (DO-3 + Met) was not affected, thus showing that the 
effect of SID1529* (N38D-V53L) or SID1529* (V53L) is a specific inhibition of the 
HP1230-HP1529 interaction (Figure 18, lanes 3-4). Taken together, these results 
is clearly demonstrate that substitution of valine to leucine at position 53 of SID1 529 
led to complete inhibition of the HP1230-HP1529 interaction and confirm that 
SID1529 derivatived might have some potential as lead compounds to inhibit 
Helicobacter pylori growth. 

Example 11 : Modulating compounds/PIM screening 

20 The following results obtained from these Examples, as well as the 

teachings in the specification are set forth in the Tables below. 

While the invention has been described in terms of the various preferred 
embodiments, the skilled artisan will appreciate that various modifications, 
substitutions, omissions and changes may be made without departing from the 
25 scope thereof. Accordingly, it is intended that the present invention be limited by 
the scope of the following claims, including equivalents thereof. 
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Complexes of interacting proteins 
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HP0667 


HP0770 


HP0339 


HP1084 


HP0336 


HP0405 


HP1423 


HP1421 


HP0667 


HP1430 


HP0339 


HP1092 


HP0336 


HPC470 


HP1423 


HP 1529 


HPG667 


HP1571 


HP0339 


HP1156 


HP033S 


HP0476 


HP0691 


HP0692 


HP1073 


HP0552 


HP0339 


HP1207 


HP0336 


HP0540 


HP0691 


HP1362 


HP1073 


HP0973 


HP0339 


HP1241 


HP0336 


HP0592 


HP0224 


HP0224 


HP1073 


HP1309 


HP0339 


HP1333 


HP0336 


HP0605 


HP0224 


HP0919 


HP1073 


HP1409 


HP0339 


HP1373 


HP0336 


HP0621 


HP0224 


HP1374 


HP1198 


HP0088 


HP0339 


HP1513 


HP0336 


HP0626 


HP0224 


HP1409 


HP1198 


HP0268 


HP1427 


HP0025 


HP0336 


HP0660 


HP0692 


HP0197 


HP119B 


HP0293 


HP1427 


HP0116 


HP0336 


HP0665 


HP0692 


HP0584 


HP1198 


HP0452 


HP1427 


HP0229 


HP0336 


HP0680 


HP0692 


HP0887 


HP1198 


HP0705 


HP1427 


HP0289 


HP0336 


HP0701 


HP0071 


HP0288 


HP1198 


HP0775 


HP1427 


HP0317 


HP0336 


HP0717 


HP0071 


HP0378 


HP1198 


HP0965 


HP1427 


HP0373 


HP0336 


HP07B0 


HP0071 


HP1165 


HP1198 


HP1032 


HP1427 


HP0810 


HP0336 


HP0774 


HP0071 


HP1450 


HP1198 


HP1114 


HP1427 


HP0896 


HP0336 


HP0775 


HP0071 


HP1503 


HP1198 


HP1124 


HP1427 


HP0913 


HP0336 


HP0779 


HP0071 


HP1503 


HP1198 


HP1198 


HP1427 


HP0944 


HP0336 


HP0786 


HP0714 


HP011B 


HP1198 


HP1274 


HP1427 


HP1120 


HP0336 


HP0794 


HP0714 


HP0289 


HP1198 


HP1378 


HP1427 


HP1157 


HP0336 


HP0795 


HP0714 


HP0527 


HP1198 


HP1411 


HP1427 (HP1157 


HP0336 


HP0850 


HP0714 


HP0528 


HP1198 


HP1541 


HP1427 


HP1157 


HP0336 


HP0858 


HP0714 


HP0552 


HP1076 


HP0033 


HP1427 


HP1243 


HP0336 


HP0903 


HP0714 IHP0714 


HP1076 


HP0275 


HP1427 


HP1342- 


HP0336 


HP0919 


HP0714 HP0958 


HP1076 


HP0304 


HP0218 


HP0527 


HP0336 


HP0929 


HP0714 


HP1138 


HP1076 


HP0319 


HP1301 


HP0030 


HP0336 


HP0996 


HP0714 


HP1300 


HP1076 


HP0449 


HP1301 


HP0705 


HP0336 


HP1026 


HP0714 


HP1446 


HP1076 


HP0753 


HP1301 


HP0887 


HP0336 


HP1037 


HP0428 


HP0284 


HP1076 


HP0841 


HP1301 


HP1409 


HP0336 


HP1132 


HP0428 


HP0705 


HP1076 


HP 1544 


HP1542 


HP0289 


HP0336 


HP1236 


HP0428 


HP1350 


HP107B 


HP1563 
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^p 1542 


~iPD289 


HP0336 


HP1241 


HP0428 


HP1409 


HP1196 


HP 1409 


HP1 542 


-JP0289 


HP0336 


HP1266 




HP0116 


HP1196 


HP1514 




HP0601 


HP0336 


HP1274 


HP1518 


HP0705 


HP1529 


HP 1230 


1 -Bait ORF 


2-Pr9y ORF 


1 -Bait ORF 


2-Prey ORF 


1-Bait ORF 


2-Prey ORF 


1-Bait ORF 


2-Prey ORF 




HP0609 


HP0336 




HP1518 


HP1065 


HP0071 


HP0278 


upl!c42 


HP0610 


HP0336 


HP 1332 


HP1518 


HP14S9 


HP0071 


HP0570 J 


HP1 542 


HP0610 


HP0336 


HP1434 


HP0547 


HP0299 


HP0071 


HP0775 


2 — 


HP0740 


HP0336 


HP1452 


HP0547 


HP0601 


HP0071 


HP1340 


Hpl542 


HP1273 


HP0336 




HP0547 


HP0687 


HP0071 


HP1409 


HP0316 


HP0621 


HP0687 


HP 1503 


HP0072 


HP1489 


HP041 1 


HP0304 




HP0895 


HP0687 


HP1506 


HP07B5 


HP0764 


HP0411 


HP0464 


HP031 6 




HP0687 




HP0765 


HP1024 


HP0411 


HP1019 




HP01 07 


HP0687 


HP0687 


HP0072 


HP1489 


HP0411 


HP1326 


HP0431 


HP0547 


HP0687 


HP0759 


HP0642 


HP0642 


HP0411 


HP1453 


HP0431 


HP0705 


HP0687 


HP0942 


HP1 173 


HP0528 


HP0411 


HP1470 


HP0674 


HP0033 






HP1 173 


HP0595 


HP0887 


HP0610 


HP0674 


HP0862 


HP0687 


HP1212 


HP1 173 


HP0610 


HP0887 


HP0887 


HP0553 




HP0687 


HP1510 


HP1 173 


HP0816 


HP0887 


HP1157 




HP0312 


HP0525 


HP0389 


HP1 173 


HP1553 


HP0887 


HP1464 


HP1 077 


HP0304 


HP0525 


HP0513 


HP 12 93 


HP1 198 


HP1198 


HP1032 


HP1077 




HP0525 


HP0525 


HP1051 


HP0435 


HP1198 


HP1218 


HP0798 


HP0798 


HP0525 


HP0550 


HP1051 


HP1513 


HP 1065 


HPD918 


HP0436 


HP0010 




HP1409 


HP1291 


HP0278 


HP1198 


HP0264 


HP0436 


HP0072 


HP0525 


HP141 1 


HP1291 


HP1283 


HP 11 98 


HP0371 


HP0436 


HP0858 


HP0525 


HP1451 


HP1291 


HP1409 


HP1198 


HP0615 


HP0436 


HP0887 


HP0887 


HP0459 


HP1291 


HP1553 


HP1198 


HP0655 


HP0436 


HP1019 


HP0887 


HP0610 


HP0084 


HP0392 


HP1198 |HP0793 


HP0436 




HP0887 


HP0S99 


HP0084 


HP0950 


HP1198 


HP1002 


HP0071 


HP1409 


HP0887 


HP0887 


HP0134 


HP1012 


HP1198 


HP1048 


HP0071 




HP0887 




HP0325 


HP0246 


HP1198 


HP1069 


HP1402 


HP0420 


HP0887 


HP1460 


HP0325 


HP0278 


HP0868 


HP0088 


HP1402 


HP0621 


HP0887 


HP1464 


HP0325 )HP0338 


HP0868 


HP0327 


HP1402 


HP0657 


HP0088 


HP0011 


HP0325 


HP0399 


HP0868 


HP0869 


HP1402 




HP0088 


HP0088 


HP0325 


HP0449 


HP0868 


HP1142 




^jp^Qg 


HP0088 


HP0221 


HP0325 


HP1321 


HP0748 


HP0048 


^pQyyg " 


HP0033 


HP0088 


HP0437 


HP0325 


HP1409 


HP0748 


HP0100 


HpQ775 


HP0056 


HP0088 


HP0464 


HP0419 


HP0321 


HP0748 


HP0695 


Hp ^ 7 _; 5 


HP0231 


HP0088 


HP0792 


HP0419 


HP0419 


HP0748 


HP0736 


HpQ775 


HP0285 


HP0088 


HP0974 


HP0419 


HP0862 


HP0748 


HP0748 


Hp0775 


HP0378 


HP0088 


HP0988 


HP0893 


HP0862 


HP0289 


HP0289 


HpQ775 


HP0621 


HP0088 


HP1045 


HP0410 


HP1080 


HP0289 


HP0289 


HP0775 


HP0645 


HP0088 


HP1096 


HP0410 


HP1198 


HP0289 


HP0610 


HP0775 


HP0705 


HP0088 


HP1198 


HP1067 


HP0392 


HP0289 


HP0887 


HP0775 


HP0775 


HP0088 


HP1535 


HP0650 


HP0012 


HP0289 


HP0922 


HP0775 


HP0792 


HP1298 


HP0012 


HP0650 


HP0272 


HP0289 


HP1038 


HP0775 


HP0965 


HP1298 


HP0278 


HP0650 


HP0317 


HP0289 


HP1542 


HP0775 


HP1063 


HP1298 


HP0449 


HP0650 


HP1026 


HP0289 


HP0289 


HP0775 


HP1274 


HP1298 


HP0522 


HP0650 


HP1245 


HP0289 


HP0289 


HP0775 


HP1355 


HP1298 


HP0528 


HP0650 


HP1547 


HP0289 


HP0610 


HP0775 


HP1409 


HP1298 


HP0696 


HP0776 


HP0067 


HP0289 


HP1355 


HP1489 


HP1489 


HP1298 


HP0705 


HP0776 


HP0278 


HP0383 


HP0547 



WO 02/066501 



76 



PCT/EP01/15428 



HP1086 


HP0019 




HP0770 


HP077G 


HP1378 


HP0507 


HP0172 


HP1086 




HP1 298 


nrUoO ( 


HP0776 


HP1409 


HP0507 


HP1409 


HP1 086 


HP?409 


HP1 298 


HP0950 


HP0897 


HP0422 


HP0507 


HP1489 


HP0071 


HP01 89 


HP1 298 


HP1379 


HP0897 


HP1362 


^p^53 


HP1 158 


HP0071 


HP1072 


HP1 298 


HP1403 


HP0895 






HP1329 


1-Bait ORF 


O DfAW ADC 


1 Rait ADC 

1 -bait UKr 


hpoov?^^' 


1 -Bait ORF 


2-Prey ORF 


1 -Bait ORF 


2-Prey ORF 










nruoyo 


HP0426 




HP1409 




H'Pll86 


HP0072 


HP0073 


HP0895 


HP0894 


HP1 1 58 




HP0071 


nr ! u\io 


HP0072 


HPQ1 56 


HP0895 


HP0895 


HP0068 


HP0070 


nrUDO / 


HP0705 


HPQ072 


HPQ21 6 


HP0895 


HP1409 


HP0068 


HP01 18 


HP0687 


nrUooy 


HP0072 


HP0437 


HP0653 


HP0247 


HP0069 


HP0067 


HP0687 


HP1 072 


HP0072 


H P06 1 0 


HP0653 


HP0653 


HP0066 


HP0066 


HP0687 


^p^y 


HP0072 


rirUOoU 


tlrUOOj 


HP1 01 2 


HP01 87 


HP0020 


HP0687 




HP0072 




HP0653 


HP1409 


HP0187 


HP0231 




HP0254 


HP1 033 


UPni7Q 

nrUj/a 


HP1 238 


HP1488 


HP0870 


HP0748 


HP0187 


riru/ uo 


HP0064 


HP0063 


HP 1444 


HP001 0 


HP0870 


HP1157 


HP0187 


HP0862 


HP 1 032 


nrUD4o 


HP"! 444 


nruuoo 


HP0870 


HP1243 


HP0187 


HP1414 




HP081 8 


HP 1 444 


nr uuyy 


HP0199 


HP0278 


HP0187 


HP1513 


— 


/jp^ao 


up^f; 




HP0199 


HP0356 


HP0958 


HP0353 


HP1 032 






HP0760 


HP0199 


HP0449 


HP0958 


HP0522 


HP1 032 


Hp13-)6 


■■ 44 




HP0199 


HP0891 


HP0958 


HP0582 


HP0062 


nrUUO<i 


■■■ 


HP1 332 


HP0199 


HP1198 


HP0958 


HP0714 




unn -I no 
nrU 1 Uo 


upTtta 


HP0347 


HP0199 


HP1313 


HP0958 


HP0884 


HP0062 




HpiQ ^ 4 


uprisen 
nruoyo 


HPC199 |HP1314 


HP0958 


HP1428 




HP^I 56 


H p -j o 74 


HP1278 


HP0875 


HP0874 


HP0958 IHP1462 




.-— — — 






HP0510 


HP0347 


HP0067 


HP0069 


HP01 84 


HPQ , 23 





Hp 1 399 


HP0510 


HP0510 


HP0145 


HP0116 




HP0238 


HP1 074 




HP1464 


HP0132 


HP0145 


HP0281 


HP01 84 


HP0354 


HP1074 


HP1497 


HP1464 


HP0740 


HP0145 


HP0468 


HP01 84 


HP0609 


HP0515 


HP1409 


HP1464 


HP1198 


HP0145 


HP0887 


HP01 84 


HP0655 


HP0515 


HP1537 


HP1464 


HP1464 


HP0145 


HP1274 


HP01 84 


HP0705 


HP0515 


HP1561 


HP1464 


HP1510 


HP0620 


HP0620 


HP0184 


HP0793 


HP 1444 


HP0088 


HP0070 


HP0068 


HP0620 


HP1409 


HP01 84 


HP1409 


HP1444' 


HP0392 


HP0070 


HP0070 


HP0595 ' 


HP0025 


HP0879 


HP0033 


HP 1444 


HP0422 


HP0377 


HP0033 


HP0595 


HP0060 


HP0879 


nruuDo 


HP1444 


HP0655 


HP0377 


HP0145 


HP0595 


HP0595 


UIDnP7Q 

tiruo r y 


HP0252 


HP1444 


HP1332 


HP0377 


HP0183 


HP0595 IHP0870 


HP0879 


HP0401 


HP1444 HP1493 


HP0377 


HP0194 


HP0595 (HP1052 


HP0879 


nrUnUi 


HP1238 HP1238 


HP0377 


HP0224 


HP0595 


HP1430 


HP0879 





HP1048 HP1198 


HP0377 


HP0285 


HP0621 


HP0609 




HP0o26 


HP0990 


HP0085 


HP0377 


HP0399 


HP0621 


HP0610 


HP0879 


HP0649 


HP0990 


HP0116 


HP0377 


HP0486 


HP0621 


HP0862 


HP0879 


HP0703 


HP0990 


HP0321 


HP0377 


HP0854 


HP1150 


HP1451 


HP0879 


HP1248 


HP0990 


HP0663 


HP0377 


HP1409 


HP0181 


HP0807 


HP0879 


HP1274 


HP0990 


HP0727 


HP0075 


HP0075 


HP1391 


HP1041 


HP0879 


HP1379 


HP0990 


HP0909 


HP0073 


HP0067 


HP0061 


HP0066 


HP0879 


HP1422 


HP0990 


HP1213 


HP0073 


HP0232 


HP0061 


HP0978 


HP0879 


HP0012 


HP0990 


HP1327 


HP0073 


HP0705 


HP0061 


HP1409 


HP0879 


HP0116 


HP0990 


HP1333 


HP0762 


HP0272 


HP1390 


HP0118 


HP0879 


HP0178 


HP0990 


HP1345 


HP0762 


HP0506 


HP1390 


HP0120 


HP0879 


HP0275 


HP104S 


HP0162 


HP0762 


HP0655 
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HP1390 


HP0154 


HP0879 


HP0281 lHP-;045 |HP0182 


HP0762 


HP1409 


HP1390 


HP0325 


HP0879 


HP0321 


HP1046 IHP0220 


HP0156 


HP0156 


HP1390 


HP0351 


HP0879 


HP0483 


HP 1046 |HP0509 


HP0452 


HP0109 


HP1390 


HP0528 


HP0879 


HP0578 


HP1046 


HP0583 


HP0452 


HP0155 


HP1390 


HP0751 


HP0879 


HP0639 


HP1046 


HP0746 


HP0452 


HP0278 


HP1390 


HP0809 


HP0879 


HP0666 


HP1046 


HP0750 


HP0452 


HP0705 


HP1390 


HP0816 


HP0879 


HP0699 


HP1046 


HP0760 


HP0452 


HP1282 


1-Bait ORF 


2-Prey ORF 


1-Bait ORF 


2-Prey ORF 


1-Bait ORF 


2-Prey ORF 


1-Bait ORF 


2-Prey ORF 


HP1390 


HP1012 


HP0879 


HP0735 


lHP/1046 


HP 1024 


HP0452 


HP1409 


HP1390 


HP1081 


HP0879 


HP0821 


HP1046 


HP1197 


HP1072 


HP0096 


HP1390 


HP1188 


HP0879 


HP1012 


HP1046 


HP1312 


HP1072 


HP0792 


HP1390 


HP1379 


HP0879 


HP1160 


HP 1046 


HP 1464 


HP1072 


HP1321 


HP1390 


HP1486 


HP0879 


HP1299 


HP1046 


HP1488 


HP1072 


HP1409 


HP1390 


HP1513 


HP0879 


HP1374 


HP1045 


HP0325 


HP1072 


HP0050 


HP0838 


HP1409 


HP0879 


HP1459 


HP1045 


HP0750 


HP1072 


HP0231 


HP1033 


HP0599 


HP0879 


HP1513 


HP1045 


HP1205 


HP1072 


HP0263 


HP1033 


HP1513 


HP0879 


HP1567 


HP0870 


HP0317 


HP1072 


HP0705 


HP1033 


HP0651 


HP1238 


HP1238 


HP0870 


HP0488 ' 


HP1072 


HP0836 


HP1033 


HP1105 


HP1238 


HP0655 


HP0870 


HP0595 


HP1072 


HP1430 


HP0602 


HP1402 


HP1015 


HP1521 


HP0962 


HP0617 


HP1255 


HP0687 


HP0608 


HP0175 


HP0167 


HP1035 


HP0962 


HP0621 


HP1255 


HP0969 


HP0608 


HP0589 


HP1014 


HP1014 


HP0962 


HP0655 


HP1255 


HP0969 


HP0608 


HP0526 


HP1014 


HP1409 


HP0962 


HP0750 


HP1255 


HP1072 


HP0608 


HP0919 


HP0047 


HP0047 


HP0962 


HP0996 


HP1255 


HP1077 


HP0232 


HP0605 


HP0047 


HP0048 


HP0962 


HP1019 


HP1255 


HP1114 


HP0849 


HP0056 


HP0047 


HP0695 


HP0962 


HP1024 


HP1255 


HP1290 


HP0849 


HP0066 


HP1497 


HP 1074 . 


HP0962 


HP<025 


HP1255 


HP1550 


HP0849 


HP0109 


HP1496 


HP0769 


HP0962 


HP1188 


HP0978 


HP0979 


HP0849 


HP0116 


HP0045 


HP0045 


HP0962 


HP1274 


HP0978 


HP1583 


HP0849 


HP0132 


HP0045 


HP 1 409 


HP0962 


HP1353 


HP0753 


HP0030 


HP0849 


HP0197 


HP0601 


HP0547 


HP0962 


HP1379 


HP0753 


HP0033 


HP0849 


HP0213 


HP0501 


HP0753 


HP0952 


HP1409 


HP0753 


HP0115 


HP0849 


HP0354 


HP0601 


HP1154 


HP0962 


HP1478 


HP0753 


HP0197 


HP0849 


HP0370 


HP0501 


HP 1377 


HP0194 


HP0592 


HP0753 


HP0492 


HP0849 ' 


HP0373 


HP0601 


HP1409 


HP0194 


HP1377 


HP0753 


HP0747 


HP0849 


HP0417 


HP1259 


HP0006 


HP0194 


HP1488 


HP0753 


HP1035 


HP0849 


HP0452 


HP1259 


HP0025 


HP0194 


HP1489 


HP0753 


HP1098 


HP0849 


HP0466 


HP1259 


HP0025 


HP0042 


HP0017 


HP0753 


HP1274 


HP0849 


HP0476 


HP1259 


HP0026 


HP0042 


HP0051 


HP0753 


HP1454 


HP0849 


HP0512 


HP1259 


HP0033 


HP0042 


HP0054 


HP0753 


HP1485 


HP0849 


HP0539 


HP1259 


HP0212 


HP0042 


HP0278 


HP0753 


HP1523 


HP0849 


HP0569 


HP1259 


HP0231 


HP0042 


HP0381 


HP0979 


HP0977 


HP0849 


HP0570 


HP1259 


HP0252 


HP0042 


HP0419 


HP0979 


HP0979 


HP0849 


HP0583 


HP1259 


HP0280 


HP0042 


HP0723 


HP0979 


HP1278 


HP0849 


HP0593 


HP1259 


HP0317 


HP0042 


HP1133 


HP0614 


HP0614 


HP0849 


HP0S18 


HP1259 


HP0354 


HP0042 


HP1318 


HP0739 


HP0016 


HP0849 


HP0621 


HP1259 


HP0405 


HP0042 


HP1392 


HP0739 


HP0379 


HP0849 


HP0717 


HP1259 


HP0585 


HP1493 


HP0033 


HP0739 


HP0453 


HP0849 


HP0760 


HP1259 


HP0615 


HP1493 


HP0043 


HP0739 


HP0527 


HP0849 


HP0971 


HP1259 


HP0643 


HP1493 


HP0051 


HP0739 


HP0547 


HP0849 


HP1037 


HP 1259 


HP0705 


HP1493 


HP0116 


HP0739 


HP0621 
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HP0849 


HP1041 


HP1259 


HP0779 


HP1493 


HP0281 


HP0739 


HP0655 


HP0849 


HP1048 


HP1259 


HP0792 


HP1493 


HP0435 


HP0739 


HP0687 


HP0849 


HP1069 


HP1259 


HP0830 


HP1493 


HP0483 


HP0739 


HP0810 


HP0849 


HP1084 


HP1259 


HP0862 


HP1493 


HP0501 


HP0739 


HP1056 


HP0849 


HP1129 


HP1259 


HP0965 


HP1493 


HP0669 


HP0739 


HP1157 


HP0849 


HP1132 


HP1259 


HP1012 


HP1493 


HP0841 


HP0739 


HP1279 


HP0849 


HP1213 


HP1259 


HP1232 


HP1493 


HP0977 


HP0739 


HP1357 


HP0849 


HP1299 


HP1259 


HP1259 


HP1493 


HP1002 


HP1025 


HP0668 


HP0849 


HP 1332 


HP1259 


HP 1299 


HP1493 


HP1025 


HP1025 


HP1025 


1-BaitORF 


2-Prey ORF 


1-Bait ORF 


2-Prey ORF 


1-Bait ORF 


2-Prey ORF 


1-Bait ORF 


2-Prey ORF 


HP0849 


HP 1374 


HP1259 


HP1342 


HP1493 


HP1138 


HP1025 


HP1114 


HP0849 


HP1379 


HP1259 


HP1411 


HP1493 


HP1198 


HP1145 


HP0182 


HP0849 


HP1393 


HP1259 


HP 1422 


HP1493 


HP1350 


HP1145 


HP0289 


HP0849 


HP1400 


HP1259 


HP1430 


HP1493 


HP1411 


HP1145 


HP0400 


HP0849 


HP1452 


HP0S62 


HP0030 


HP1493 


HP1533 


HP1145 


HP0550 


HP0849 


HP1489 


HP0962 


HP0088 


HP0862 


HP0607 


HP1145 


HP0621 


HP0849 


HP1493 


HP0962 


HP0111 


HP0862 


HP0621 


HP1145 


HP0669 


HP0849 


HP1553 


HP0962 


HP0244 


HP0862 


HP0791 ' 


HP1145 


HP0690 


HP1015 


HP0299 


HP0962 


HP0285 


HP0862 


HP1474 


HP1145 


HP0705 


HP1015 


HP0547 


HP0962 


HP0410 


HP0402 


HP0825 


HP1145 


HP1010 


HP1015 


HP0607 


HP0962 


HP0426 


HP0402 


HP1130 


HP0177 


HP1409 


HP1015 


HP0687 


HP0962 


HP0490 


HP0402 


HP1234 


HP0177 


HP1508 


HP1015 


HP1435 


HP0952 


HP0528 


HP0402 


HP1478 


HP0177 


HP1550 


HP1015 


HP1490 


HP09S2 


HP0537 


HP1255 


HP0607 


HP0291 


HP0279 


HP0291 


HP0291 


HP1114 


HP0919 


HP1236 


HP1148 


HP0016 


HP0791 


HP0291 


HP1409 


HP0481 


HP0281 


HP1236 


HP1263 


HP0016 


HP0969 


HP0291 


HP1541 


[HP0481 


HP0298 


HP0909 


HP0587 


HP0016 


HP1017 


HP0291 


HP1550 


HP0481 


HP0416 


HP0909 


HP0607 


HP0016 


HP 1183 


HP0874 


HP0875 


HP0481 


HP0421 


HP0909 


HP1081 


HP0016 


HP1257 


HP1382 


HP0025 


HP0481 


HP0494 


HP0909 


HP1521 


HP0016 


HP1450 


HP1382 


HP0182 


HP0481 


HP0645 


HP0909 


HP1561 


HP0016 


HP1490 


HP1382 


HP0269 


HP0481 


HP0736 


HP1478 


HP1478 


HP1559 


HP0010 


HP1382 


HP0317 


HP0481 


HP0741 


HP1231 


HP1247 


HP1559 


HP0056 


HP 1382 


HP0705 


HP0481 


HP0779 


HP0262 


HP0355 


HP1559 


HP0087 


HP1382 


HP0896 


HP0481 


HP0971 


HP0262 


HP1318 


HP1559 


HP0116 


HP1382 


HP1142 


HP0481 


HP1000 


HP1230 


HP1230 


HP1559 


HP0116 


HP1382 


HP1182 


HP0481 


HP1025 


HP1230 


HP1529 


HP1559 


HP0116 


HP1382 


HP1237 


HP0481 


HP1266 


HP0263 


HP0056 


HP1559 


HP0120 


HP1382 


HP1243 


HP0481 


HP1406 


HP0263 


HP0486 


HP1559 


HP0132 


HP1382 


HP1342 


HP0481 


HP1435 


HP0263 


HP0605 


HP1559 


HP0145 


HP0150 


HP1037 


HP0481 


HP1513 


HP0263 


HP0779 


HP1559 


HP0355 


HP0150 


HP1Q67 


HP1118 


HP0822 


HP0263 


HP0792 


HP1559 


HP0364 


HP0150 


HP1198 


HP0149 


HP0103 


HP0263 


HP1238 


HP1559 


HP0379 


HPO150 


HP1371 


HP0149 


HP0196 


HP0263 


HP1325 


HP1559 


HP0406 


HP0150 


HP1409 


HP0149 


HP0331 


HP0263 


HP1332 


HP1559 


HP0466 


HP0150 


HP0449 


HP0149 


HP0527 


HP0263 


HP1489 


HP1559 


HP0488 


HP0294 


HP0100 


HP0149 


HP0547 


HP0263 


HP1513 


HP1559 


HP0589 


HP0294 


HP0289 


HP0149 


HP0693 


HP0224 


HP0250 


HP1559 


HP0606 


HP0294 


HP0377 


HP0149 


HP0775 


HP0224 


HP0920 


HP1559 


HP0615 


HP0294 


HP0399 


HP0149 


HP1034 


HP0224 


HP1076 


HP1559 


HP0621 


HP0294 


HP1318 


HP0149 


HP1079 


HP0224 


HP1198 


HP1559 


HP0660 
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HP0294 


HP1409 


HP0149 


HP1115 


HP0109 


HP0101 ' 


HP1559 


HP0663 


HP0822 


HP0651 


HP0149 


HP1248 


HP0109 


HP0488 


HP1559 


HP0669 


HP0822 


HP0725 


HP0149 


HP1332 


HP0261 


HP0281 


HP1559 


HP0680 


HP0822 


HP0822 


HP0028 


HP0214 


HP02B1 


HP0537 


HP1559 


HP0710 


HP0944 


HP0033 


HP0028 


HP0354 


HP0261 


HP0583 


HP1559 


HP0780 


HP0944 


HP0224 


HP0028 


HP0527 


HP0261 


HP0605 


HP1559 


HP0795 


HP0944 


HP0695 


HP0028 


HP0547 


HP0261 


HP0632 


HP1559 


HP0817 


HP0944 


HP0944 


HP0028 


HP0607 


HP0261 


HP0655 


HP1559 


HP0858 


HP0944 


HP1318 


HP0028 


HP0791 


HP0261 


HP0750 


HP1559 


HP0929 


HP0944 


HP1529 


HP0028 


HP0969 


HP0261 


HP0887 


HP1559 


HP1026 


HP0823 


HP0100 


HP0028 


HP1041 


HP0261 


HP1019 


HP1559 


HP1041 


1-BaitORF 


2-Prey ORF 


1-Bait ORF 


2-Prey ORF 


1-Bait ORF 


2-Prey ORF 


1-Bait ORF 


2-Prey ORF 


HP0823 


HP0255 


HP0028 


HP1072 


HP0261 


HP1124 


HP1559 


HP1075 


HP0823 


HP0352 


HP0028 


HP1222 


HP0261 


HP1274 


HP1559 


HP1 116 


HP0823 


HP0488 


HP0028 


HP1450 


HP0261 


HP1350 


HP1559 


HP1 143 


HP0823 


HP0919 


HP0028 


HP1470 


HP0261 


HP1489 


HP1559 


HP1187 


HP0823 


HP0969 


HP0028 


HP1503 


HP0261 


HP1513 


HP1559 


HP1207 


HP0823 


HP1024 


HP0028 


HP1550 


HP0261 


HP1517 


HP1559 


HP1236 


HP0823 


HP1116 


HP0821 


HP0705 


HP0261 


HP1543 


HP1559 


HP1266 


HP0823 


HP1409 


HP0821 


HP0821 


HP0902 


HP0588 


HP1559 


HP1274 


HP0947 


HP0096 


HP0821 


HP1114 


HP0902 


HP0887 


HP1559 


HP1325 


HP0947 


HP0116 


HP0821 


HP1409 


HP0902 


HP1409 


HP1559 


HP1393 


HP0947 


HP0569 


HP0389 


HP0696 


HP0016 


HP0C16 


HP1559 


HP1434 


HP0947 


HP0621 


HP0389 


HP1262 


HP0016 


HP0038 


HP1559 


HP1452 


HP0947 


HP1012 


HP0389 


HP1400 


HP0016 


HP0228 


HP1559 


HP1453 


HP0947 


HP1205 


HP02S8 


HP1198 


HP0016 


HP0439 


HP1559 


HP1470 


HP0947 


HP1373 


HP1236 


HP0026 


HP0016 


HP0687 


HP1559 


HP1472 


HP1114 


HP0651 


HP1236 


HP0347 


HP0016 


HP0697 


HP1559 


HP1479 


HP1114 


HP0705 IHP1236 


HP0355 IHP0016 HP0759 


HP1559 


HP1489 


HP1559 


HP1493 


HP1246 


HP0326 [HP1489 |HP1489 


HP0806 


HP0134 


HP0108 


HP1399 


HP1246 


HP0407 >HP0391 


HP0099 


HP0928 


HP0928 


HP0108 


HP1541 


HP 1246 


HP0886 


HP0391 


HP0392 


HP0928 


HP1409 


HP0909 


HP0996 


HP1246 


HP1035 


HP1000 


HP1409 


HP1204 


HP0514 


HP0909 


HP1041 


HP1246 


HP 1244 


HP0031 


HP0006 


HP1324 


HP0010 


HP0909 


HP1081 


HP1246 


HP1460 


HP0031 


HP0056 


HP0115 


HP0317 


HP0909 


HP1241 


HP0035 


HPQ035 


HP0031 


HP0281 


HP0115 


HP0753 


HP0909 


HP1256 


HP1245 


HP0650 


HP0031 


HP1041 


HP0115 


HP1157 


HP0909 


HP1321 


HP1245 


HP0717 


HP0031 


HP1513 


HP1334 


HP1329 


HP1312 


HP0083 


HP1245 


HP0816 


HP0031 


HP1567 


HP1334 


HP1409 


HP1312 


HP0282 


HP1245 


HP1024 


HP0705 


HP0254 


HP0002 


HP0002 


HP1312 


HP0413 


HP1245 


HP1523 


HP0705 


HP1406 


HP0002 


HP1200 


HP1312 


HP0705 


HP0399 


HP0399 


HP0705 


HP1541 


HP0002 


HP1409 


HP1312 


HP1046 


HP0399 


HP0621 


HP0032 


HP0032 


HP0002 


HP1451 


HP1312 


HP1316 


HP0399 


HP1048 


HP0274 


HP00B6 


HP0124 


HP0377 


HP1312 


HP1409 


HP0398 


HP1430 


HP0274 


HP0518 


HP0124 


HP1409 


HP1312 


HP1430 


HP1244 


HP0857 


HP0274 


HP0951 


HP0242 


HP0242 


HP1312 


HP1523 


HP1244 


HP1246 


HP0274 


HP1294 


HP0001 


HP0050 


HP0954 


HP0045 


HP0276 


HP1263 


HP1482 


HP0102 


HP0001 


HP0355 


HP0954 


HP0466 


HP0276 


HP0012 


HP1482 


HP0259 


HP0001 


HP0792 


HP0712 


HP0152 


HP0276 


HP1489 


HP1482 


HP0466 


HP0243 


HP1409 


HP0712 


HP0197 


HP0340 


HP001 1 


HP1482 


HP0489 


HP1338 


HP1298 
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HP0712 


HP0278 


HP0340 IHP0096 


HP1 482 


HP0645 


HP1338 


HP1338 


HP0712 


HP0281 


HP0340 HP0231 




HP0760 


HP1338 


HP1409 


HP0712 


HP0662 


HP034Q HPQ34G 


"HP1482 


_ HP1V15 


HP1338 


HP1523 


HP0712 


HP0714 


HP0340 HPQ695 


HP1 482 


HP1369 


HP0249 


HP0012 


HP0712 


HP0736 


HP0340 'HP0730 


HP0073 


HP0073 


HP0249 


HP0099 


HP0712 


HP0836 


HP0340 HP0821 


nrUU / o 


HP0232 


HP0249 


HP0174 


HP0712 


HP0850 


HP0340 HP0834 


nrUU / J 


HP0259 


HP0249 


HP0214 


HP0712 


HP1 160 


HP0340 HP1088 


HP1449 


HP061 5 


HP0249 


HP0415 


HP0712 


HP1470 


HP0340 HP1138 


HP1449 


HP1019 


HP0249 


HP0527 


HP0712 


HP1489 


HP0340 HPV98 


HP1 449 


HP 1332 


HP0249 


HP0547 


HP0712 


HP1513 


HP0340 HP1274 




HP1068 


HP0249 


HP0610 


HP0100 


HP0123 


HP0340 HP1362 


HP1 322 


HP001 3 


HP0249 


HP0787 


HP0100 


HP0132 


HP0950 HP0030 




HP0649 


HP0249 


HP0922 


1-Bait ORF 


2-Prey ORF 


1-Bait ORF 2-Prey ORF 


i -Dan UKr 


2-Prey ORF 


1-Bait ORF 


2-Prey O 


HP0100 


HP0278 


HP0950 [HP0116 


HP1 322 


jlp 1 ^? 88 


HP0249 


HP0969 


HP0100 


HP0585 


HP0950 HP0192 




41 


HP0249 


HP1041 


HP0100 


HP0617 


HP0950 HP0304 


HP1 200 


HP1037 


HP0249 


HP1154 


HP0100 


HP0707 


HP0950 HP0394 




HP1 1 99 


HP0249 


HP1157 


HP0100 


HP0710 


HP0950 HP0884 


HP1 563 


HP0785 


HP0249 


HP1377 


HP0100 


HP0726 


HP0950 HP1025 


HP1 563 


HP1080 


HP0249 


HP1409 


HP0100 


HP0830 


HP0950 HP1198 


nruouu 


HP0433 


HP0249 


HP1487 


HP0100 


HP0852 


HP0950 HP1438 


HP0800 


HP0687 


HP0249 


HP1503 


HP0100 


HP0898 


HP1247 HP1231 


HP0800 


HP0800 


HP0249 


HP1543 


HP0100 


HP0919 


HP1247 HP1353 


HP0800 


HP0801 


HP0591 


HP0582 


HP0100 


HP1050 


HP0951 HP0116 


t-lpnftnn 
nrUOUU 


H p ° 924 


HP0591 


HP0610 


HP0100 


HP1157 


HP0951 HP1409 


nrUOUU 


HP :z67 


HP0591 


HP0663 


HP0100 


HP1379 


HP0951 HP1411 


nrUOUU 


HP1460 


HP0368 


HP0409 


HP0100 


HP141 1 


HP1489 HP0001 


nrUOOo 


HP0353 


HP0368 


HP0435 


HP0100 


HP1458 


HP1489 HP0351 




HP0406 


HP0368 


HP0897 


HP0105 


HP0105 


HP1489 HP0540 






HP0368 


HP1293 


HP0105 


HP1409 


HP1489 HP0705 


HP0801 


HP0^52 


4P0368 


HP1356 


HP0837 


HP0705 


HP1489 HP0797 


HP0801 


nrUOUU 


HP0368 


HP1410 


HP0837 


HP0836 


HP1489 HP0841 


-jpriRfu 
nruou 1 




HP0005 


HP0088 


HP1246 


HP0121 


HP1489 |hP0862 


-JpQgQg 


HP1 1 98 


HP0005 


HP01 16 


HP0005 ' 


HP0175 


HP1537 HP1189 


" I P 1414 


" IP ° 522 


HP0136 


HP0983 


HP0005 


HP0278 


HP1537 |HP1245 




HP0655 


HP0136 


HP1 119 


HP0005 


HP0317 


HP1537 


HP1325 




^ 


~ P1124 


HP0136 


HP1195 


HP0005 


HP0583 


HP1537 


HP1430 




" IP1024 


HP0136 


HP1205 


HP0005 


HP0615 


HP1537 


HP1489 




HP1413 


4P1409 


HP0136 


HP1241 


HP0005 


HP0640 


HP1537 


HP1541 




HP1413 


HP0136 


HP1393 


HP0005 


HP0657 


HP1286 


HP0231 


HP1533 


HP0010 


HP0136 


HP1430 


HP0005 


HP0665 


HP1286 


HP0625 


HP1533 


HP0033 


HP0136 




HP0005 


HP0714 


HP1286 


HP0833 


HP1533 


HP0351 


HP0136 


HP1553 


HP0005 


HP0977 


HP1286 


HP0934 


HP1533 


HP0381 


HP0379 


HP0706 


HP0005 


4P1274 


HP1286 


HP1119 


HP1533 


HP0402 


^0137 


-IP0010 


HP0005 


HP1350 Y 


HP1286 


HP1222 


HP1533 


HP0944 


HP0137 


-IP0066 


HP0005 I- 


HP1409 I- 


4P1286 


HP1379 Y 


HP1533 


HP1430 


HP0137 


^0109 


HP0005 Y 


-1P1411 I- 


HP1286 


HP 1492 Y 


HP1224 Y 


HP0278 \ 


HP0137 


HP0197 


HP0247 Y 


iP0117 Y 


1P1411 


HP0033 Y 


^P1224 I- 


iP0781 Y 


4P0137 Y 


iP0244 


HP0247 Y 


1P0247 Y 


-IP1411 


HP0351 Y 


4P1224 I- 


iP0976 Y 


4P0137 Y 


■1P0247 


HP0247 Y 


-1P1316 Y 


4P1411 


HP0397 Y 


iP1224 Y 


1P1409 Y 


1P0137 Y 


iP0289 
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CLAIMS 

What is claimed is: 

1 . A complex of protein-protein interactions in Helicobacter pylori as defined 
in Table 1 and Table 8. 

2. A complex of polynucleotides in Helicobacter pylori encoding for the 
polypeptides defined in Table 1 and Table 8. 

3. A recombinant host cell expressing the interacting polypeptides in 
Helicobacter pylori defined in Table 1 and Table 8. 

4. A method for selecting a modulating compound in Helicobacter pylori 
comprising: 

(a) cultivating a recombinant host cell on a selective medium containing a 
modulating compound and a reporter gene the expression of which is 
toxic for said recombinant host cell wherein said recombinant host cell 
is transformed with two vectors: 

(i) wherein said first vector comprises a polynucleotide 
encoding a first hybrid polypeptide and a DNA bonding 
domain; 

(ii) wherein said second vector comprises a polynucleotide 
encoding a second hybrid polypeptide and an activating 
domain that activates said toxic reporter gene when the 
first and second hybrid polypeptides interact; 

(b) selecting said modulating compound which inhibits the growth of said 
recombinant host cell. 

5. A modulating compound obtained from the method of Claim 4. 

6. A SID® polypeptide in Helicobacter pylori comprising the even SEQ ID 
Nos. 2 to 3256 in column 3 of Table 2, even SEQ ID Nos. 6590 to 6594 in 
Table 7 and even SEQ ID Nos. 6596 to 6644 in Table 8. 

7. A SID® polynucleotide in Helicobacter pylori comprising the uneven SEQ 
ID Nos. 1 to 3255 in column 2 of Table 2, uneven SEQ ID Nos. 6589 to 
6593 in Table 7 and uneven SEQ ID Nos. 6595 to 6643 in Table 8. 

8. A vector comprising the SID® polynucleotide in Helicobacter pylori 
comprising the uneven SEQ ID Nos. 1 to 3255 in column 2 of Table 2, 
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uneven SEQ ID Nos. 6589 to 6593 in Table 7 and uneven SEQ ID Nos. 
6595 to 6643 in Table 8. 

9. A fragment of said SID® polypeptide according to Claim 6. 

10. A variant of said SID® polypeptide according to Claim 6. 

1 1 .A fragment of said SID® polynucleotide according to Claim 7. 

12. A variant of said SID® polynucleotide according to Claim 7. 

13. A vector comprising the SID® polypeptide according to Claim 11. 

14. A recombinant host cell containing the vectors according to Claim 8. 

15. A pharmaceutical composition comprising a modulating compound of claim 
5 and a pharmaceutical^ acceptable carrier. 

16. A pharmaceutical composition comprising a SID® polypeptide of claim 6, 
and a pharmaceutical^ acceptable carrier. 

17. A pharmaceutical composition comprising the recombinant host cells of 
claim 14 and a pharmaceutical^ acceptable carrier. 

18. A protein chip comprising the polypeptides of claim 6. 
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1/20 




Oligo 160 



|gagagtagtaacaaaggtc| AAAGACAGTTGACTGTATCGCCG GAA TTT AT 

gfil SmaI ^ _BamH][_ gall P stJ 

G G CC ATG G AG GCC CCG GGG ATC CGT CGA CCT GCA GCC 
Ncol 

Oligo 161 

AAG CTA ATT |ccgggcgaatttcttat^ 

Oligo 160 5' GAGAGTAGTAACAAAGGTC 3' 

Oligo 161 5' CATAAGAAATTCGCCCGG3' FIGURE 1 
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pB5 



287 BfrBI 
287 Nsi\ 
287 Ppu10\ 



Alias: pAS2DDNSl 
Application: 2HY (bait) 
Backbone : pAS2DD 
Specificity: Sfi non-oriented 
Selection : ampicillin 




Spa I 4774 

• Eag \ 4767 
Not I 4766 
Sac II 4764 
A#u113l 4764 
■Sffl 47S3 
Sty I 4749 
/tool 4749 
Afccl 4746 
Hpa\ 4570 



2986 /V30MIV 
3289 Nrvl 
3293 Ecf 13511 



3384 Af/trl 
34S5 P/wMI 
3515 flfeAl 



Oligo 160 

| gagagtagtaacaaaggtc| AAAGACAGTTGACTGTATCGCCG GAA TTT ATG 



Spel 



GCCATGGCC GCA GGG GCC GCG GCC GC A CTA GTG GGG ATC C 
Ncol NotI 

£22* Siil Pstl 

TT AAT[TAA| GGG CCA CTGGGG CCC CTC GAC CTG CAG CCA 
Pad 

Oligo 161 



AGC TAA TT jccgggcgaatttottatfj 

Oligo 160 5' GAGAGTAGTAACAAAGGTC 3 ' 

O]igo 161 5' CATaAGAAATTCGCCCGG 3 ' figure 2 
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Oligo 160 

| gagagtagtaacaaaggtc| AAAGACAGTTGACTGTATCGCCG GAA TTT ATG 



G CC ATG G CC GGA CGG GCC GCG GCC GC A CTA GTG GGG ATC C 
Ncol Not i 

STOP Sfil PstI 

TT AAT (TAA( gGG CCA CTGGGG CCC CTC GAC CTG CAGCCA 
Pad 

Oligo 161 



AGC TAA TT |ccgggcgaatttcttat^ 
Oligo 160 5' GAGAGTAGTAACAAAGGTC 3 ' 
Oligo 161 5' CATAAGAAATTCGCCCGG 3 ' 



SUBSTITUTE SHEET (RULE 26) 
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pB13 



Alias: pGBT9NSl 
A pplication : 2HY (bait) 
Packbone : pGBT9 
Specificity : Sfi non- 



Ppu 101 5290 
ft/a I 5290 
flfrBI 5290 



851 Ml Cgnsmictedby CR 
651 Xho\ 
709 Hpa\ 
724 Bsr Gl 
B85 Afeel 
868 Wcol 
868 Sty] 
903 Af/i/1131 
903 Sac II 

905 AtoM 

906 &sl 




1440 fismFI 

1530 BstXI 
1604 fi»RV 
1545 Bsu 36! 
1747 Mfel 
1803 Xba I 



2730 *hvNI 



2171 Pvull 
2226 Sap\ 
2348 -asm 

2349 BspLUHI 



Oligo 160 



[ gagagtagtaacaaaggtcj AAAGACAGTTGACTGTATCGCCG GAA TTT ATG 
SfiT Sac II 



Spel 



G CC ATG GCC GCA GGG GCC GCG GCC GC A CTA GTG GGG ATC C 
Ncol No ti 

STOP Sfi I p s tl 

TT AAT (tAa| GGG CCA CTG GGG CCC CTC GAC CTG CAG CCA 
Pad 

Oligo 161 

AGC TAA TT [txgggcgaatttcttatgf 

Oligo 160 5' GAGAGTAGTAACAAAGGTC 3' 

Oligo 161 5' CATAAGAAATTCGCCCGG 3' FIGURE 4 
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Alias : pGBT9NS2 

2HY (bait) 
pGBT9 

... Sfi oriented 

Selection : ampicillin 
bp CR 




1440 Bsm Fl 

1590 BstXl 
1604 ECORV 
1S45 Bsu 36I 
1747 Mfel 
1803 X&a I 
1809 XpaBI 
1809 Ssf API 
929 Pml\ 
2171 Pvu II 
2226 Sap I 
2349 -Mill 
2349 asp LU1 11 



Oligo 160 



) gagagtagtaacaaaggtc| AAAGACAGTTGACTGTATCGCCG GAA TTT ATG 



Sfil 



Sac II 



Spel 



■ — "F*- * _nauiiu 

GCC ATG GCC GGA CGG GCC GCG GCC GC A CTA GTG GGG ATC C 
Ncol NotI 



Sfil 



Apal 



TT AAT [tAaI GGG CCA CTG GGG CCC CTC GAC CTG CAG CCA 



Oligo 161 

AGC TAA TT J ccgggcgaatttcttatgj 



FIGURE 5 
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pB20 



Constructed by: LP 



Ppu iOt 5451 
Nsil 5451 
BfrBI 5451 
/W387/7I 5445 
Ava I 5445 



161 flsfBI 

447 Hin ell 
447 Hpal 
486 Acf I 
517 Mfal 

759 flsrol 
957 Anel 
975 Sg/ll 
99BXcml 
1045 Eco Rl 
1053 Sffl • 
1O64 05fOSI 
1064 Af/i/1131 
1064 Sec H 

1066 AtoN 

1067 £a£7 1 



Alias: pLcxlONS2 
A pplication : 2HY (bait) 
Backbone : pLex 1 0 (pB9) 
Specificity : Sfi-oriented 




1964 Xba\ 
1970ApaBl 
1970 Bst API 
2090 Rm/I 
2332 Pvu II 
2387 Sap] 
2510 Ssp LU11I 



EcoRI 



Siil 



Not I 



Spel 



GAA TTC GGG GCC GGA CGG G CC GCG G CC GCA CTA GTG 
_ . SacU 

BamHI sTor 

GGG ATC CTT AAT [tAA| GGG CCA CTG GGG CC C CTC GAC 
Pad Sfil 

CTG CAG 

Pstl FIGURE 6 



SUBSTITUTE SHEET (RULE 26) 



WO 02/066501 



PCT/EP01/15428 



pPl 



flc/l 7932 
Spei 7917^ 
Xfeal 7533 



Alias : pACTIIst 

2HY(prey) 
pACTII 

Selection : ampicillin 




Ssa B647I 5778 
fls/EII 5522 



i GATGTTTAATACCACTACAATGGATGATGTATATAACrATCTATT 

Bgin 



1 

jCCCAAAAAAAGAGATCTGTATGGCTTACCCATACGATGTTCCAG 



Smal 



. BamHI 



ATTACGCTAGCTTGGGTGGTCATATGG CC ATG G AG GCC CCG GGG ATC CGA ATT 

SacI Xhoi Bgin 

CGA GCT CGA CTA OCT AGC TGA CTC GAG AGA TCT ATGAAT 



cgtagatactgaaaaacccc I GCAAGTT 



ABS1 5CGTTTGGAATCACTACAGG3' 

JC90 5 CGATGATGAAGATACCCGACCAAA3' 

162 5'GGGGTTTTTCAGTATCTACG3 , 

ABS2 5' CACGATGCACAGTTGAAGTG 3' 

53 5' GAAATTGAGATGGTGCACGATGCAC3' 



FIGURE 7 
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WO 02/066501 PCT/EP01/15428 



pP2 



i:2HY(prey) 
pACTTIst 
Specificity-Sfi non-oriented 
i: ampicillin 
.SW 




3100 S/7I- 
3111 M/i/1131 
3111 Sac II 
3129 SamHI 
3135 Pad 
3144 SSI • 
3152 4>a' 
3152 Sa/ill 



CG |cgtttggaatoactacagg| GATGTTTAATACCACTACAATGGATGATGTATATAACTATCTATT 

JC90 BglD 

ajcCCAAAAAAAGAGATCTGTATGGCTTACCCATACGATGTTCCAG 



Sfi I Sac II 



ATTACGCTAGCTTGGGTGGTCATATGG CC atg GCC GCAGGG gcc GCG GCCGCA 
d „» „ Ncol 



CTA GTG GGG ATC CTT AAT|TAA|GGG CCA CTG GGG CCC CTC GAGAGA TCT 



ATGAAT jcgtagatactgaaaaaccccj GCAAGTT cacttcaactktgcatcgtg 



162 ABS2 
ABS1 5' CGTTTGGAATCACTACAGG 3' 
JC90 5' CGATG ATG AAGATACCCCACCAAA 3 ' 
162 5 GGGGTTTTTCAGTATCTACG3' 
ABS2 5' CACGATGCACAGTTGAAGTG3' 
53 5' GAAATTGAGATGGTGCACGATGCAC 3' 
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Bdl 7947 
Xbal 7548 



P-P3 Application; 2HY (prey) 

Backbone: pACIHst 
§E££ificity.Sfi oriented 
SgMiP.E: ampiciilin 
Constructed hy-KW 




Sse 86471 5793 
Est Ell SS37 
C/al 5521 
Age I 5422 



Afcol 
3100 Sffl • 
3111 Afu113l 
3111 Sac II 
3129 Bam HI 
3135 Pael 
3144 SBi • 
3152 

3152 San II 
3152 Bap 1201 
3158 Sdl 



ABS1 

CG lcgtttggaatcactacaggl GATGTTTAATACCACTACAATGGATCtATfiTATATA a TAT1 
JC90 



JCCCAAAAAAAGAGATCTGTATGGCTTACCCATACGATGTTCCAG 



ATTACGCTAGCTTGGGTGGTCATATGGCC_ATG_GCC ggacgg gcc gcg gccgca 

BamHI Pad Nco1 
CTA OTP OOP ATC C^S ^GOG CCA CTG GGG CCC CIC GAG AGA TCT 



cacttcaacl^tgcatcgtg 



ABS2 



caccatctcaatttc 



162 

ABS1 5'CGTTTGGAATCACTACAGG^ 
JC90 5' CGATGATGAAGATACCCCACCAAA 3' 
162 5' GGGGTTTTTCAGTATCTACG 3' 
ABS2 5 CACGATGCACAGITGAAGTG3' 
53 5' GAAATTGAGATGGTGCACGATGCAC 3' 

FIGURE 9 
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DP6 asB&ffi 

* 547 R 



flsrol 7120 



B34l\taol 

i:2HY(prey) 



flpi/1017091 a ^Lf\ Easfcbone: pGAD3S2X 

N90 r%% %\ ls®7& Specificity : Sfi non-oricntcd 

Stol 6945 BB2SBI- 



Wart 6945 
Kasl 6945 




SSS; Cop S tnic.edby:SW 

B90 Bsp120l 
896 Sa l 



3443 Sevl 



GATGTTTAATACCACTACAATGGATGATGTATATAACTATCTATT 

JC90 

| cgatgatgaagataccccaccaaaf CCCAAAAAAAGAGATCCTAGAACTA 

Sfil Sa «=" Spel Bam HI 

G CC ATG GC C GCA GGG GCC GCG GC C OCA CTA GTG GGG ATC C 
Ncol NotI 

STOP SfiI Xhol XbaI 

TT AAT)TAA|GGG CCA CTG GGG CCC CTC GAG [TAG) C^JrG TCjTAG]\ 

STOP STOP STOP 

GGCCCGGTACCCAATTCGCCCTATAGTGAGTCGTATTACAATTCACTGGCCGTCGTTTTA 
CAA CGTCGTGACTGGGA AAACCCTGATCTATGAAT | cgtagatactgaaaaacccc | GCAA 
GTT i 



cacttcaacmtgcatcgtg 



caccatctcaatttctttc 



ABS2 53 
ABS1 5' CGTTTGGAATCACTACAGG 3' 
JC90 5' CGATGATGAAGATACCCCACCAAA3' 
162 5' GGGGTTTTTCAGTATCTACG3' 
ABS2 5 CACGATGCACAGTTGAAGTG3' 

53 5' GAAATTGAGATGGTGCACGATGCAC3' FIGURE 10 
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PGAD3S2XNS2 
2HY(prey) 
Backbone: pGAD3S2X 
Specificity: Sfi oriented 
ampicillin 




GATGTTTAATACCACTACAATGGATGATGTATATAACTATCTATT 



i| CCCAAAAAAAGAGATCCTAGAACTA 
; i Sac II 



Spel 



G CC ATG GC C GGA CGG G CC GCG O CT. GCA CTA GTG GGG ATC C 
Ncol pjotl 

Sfi I" Xhol Xb.I 

TT AAT |taaJ GGG CCA CTG GGG CCC CTC GAG EagJ drA~G>G T OJAGk 

stop stop stop 

ggcccggtacccaattcgccctatagtgagtcgtattacaattcactggccgtcgtttta 



caa cgtcgtgactgggaaaaccctgatc tatgaat [ 



4 



ABS2 



53 



ABS1 1 5' CGTTTGGAATCACTACAGG 3' 

iC99 5' CGATGATGAAGATACCCCACCAAA3 > 

162 5' GGGGTTTTTCAGTATCTACG 3* 

ABS2 5' CACGATGCACAGTTGAAGTG 3' 

53 5' GAAATTGAGATGGTGCACGATGCAC3' 
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FIGURE 12 
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WO 02/066501 
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pCmAHU 

pfecUVS 



oyaA' MCS cat Orip15A 



pT25 



cyaA' MCS cat Oripl5A 

*'' p sl ' , Bam HI < Kpn I 

gctgcagggtcgactctagaggatccccgggtacctaagiaaciaagaattc 
AAGSTLEDPRVPK stop 



pT18 




Kpnl 



tgggtaccgggccccccctcgaggtcgacggtatcgataagcttgatatcgaa"ttcc 
WVPGPPSRSTVSISLISNS 



FIGURE 14 
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a 

1/1 



FIGURE 15 
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Exemple of Protein Interaction Map 



FIGURE 16 
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+ SID1529 
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FIGURE 19 
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pP7-centro 



179 ApaB) 
179 Bst API 
183 Nde\ 
306 Pvu II 

396 Ecc Rl • 

402 Eel 13611 

402 Sac I 

419 Sph\ 

424 Sbf\ 

425 Psf I 
627 Wael 
627 A/goMIV 

736 Bsm I 



Wrt/I 6568 ' 
Pm/I 6452 
Swal 6428 



I 5721 
I 5721 
I 5721 
M 5662 



pP7-centro 



8922 base pairs 
Unique Sites 



3982 PpuM 
3982 Sse 8647I 
4128 BstEU 
4260 Bsm Fl 
4344 Age I 
4409 Afi II 



1962 Rsrll 
1998 Eco 47III 
2103 SexAl 
: 2132 M/ol 
I 2249 Nco I 
2266 Wo* I 
2282 Sam HI 
2305 Apa I 
2305 Bsp 1201 
2326 XJba I • 
2549 Msc I 



ATGGCCGgAcGGGCC GCGGCCGC ACTAGTGG GGATCC TTAATTAAGGG 
NotI BamHI 

C C ACTGGGGCCC CTC GAGTAGCTAGTGTCTAG AGGCCC GGTAC C C A ATT 



FIGURE 20 



